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Preface 


The present book is the first part of our three-part 
textbook “Higher Mathematics”. We deal here with the 
fundamentals of the theory of determinants, the elements 
of the theory of matrices, the theory of systems of linear 
equations, and vector algebra. The book is also intended 
to introduce its readers to the basic aspects of linear 
algebra: linear operators, orthogonal transformations, 
self-adjoint operators, the quadratic form and reducing 
it to the canonical form. Elements of analytical geometry 
(the straight line, the plane, the straight line in space, 
and second-order curves and surfaces) are also included. 

As arule, our arguments are accompanied by exhaus- 
tive proofs. The material is introduced so that the proofs 
for the general n-dimensional case may be omitted without 
the loss of either the formulation of the statement or 
a detailed explanation of how the subject matter stands in 
the two- and three-dimensional cases. 

The canonical forms of second-order curves and sur- 
faces are treated briefly in this book since it is assumed 
that they will be studied additionally by solving parti- 
cular problems using the methods of mathematical analy- 
sis. The quadratic form is dealt with by the methods of 
mathematical, or, if more convenient, by functional 
analysis. 

Although we have mentioned that this is the first book 
in our series, its material is closely interwoven with 
that of the second book dedicated to differential and 
integral calculus. The third book will consider differen- 
tial equations, multiple integrals, series, and the theory 
of functions of a complex variable. Being complementary 
books of a single series entitled “Higher Mathematics”, 
at the same time they may be regarded as independent edu- 
cational aids for students of higher engineering insti- 


tutions, 
The authors 


Sec. 1. Second-order Determinants 


Let there be given numbers aj, @,, ,, b,. They define 
the number a,b. — a,b, which is called the determinant 
of the second order and is written in the following way: 


Qa, as 


by by = ayb,— Debye (1) 


The numbers a,, d,, b;, b, are termed elements of the 
determinant. Considering determinant (1), we distinguish 
the first row a,, a, and the second row b,, bz, the first 
column a,, b,; and the second column ay, bo. 

One can easily verify the following properties of deter- 
minanis. 

The value of a determinant: 

(a) does not change if each of the rows is substituted by 
a column of the same position number: 


a, Qs ay by ; 
b, by 7 Qe, Dy 
(b) changes sign if its rows or columns are interchanged: 
A, as by be 
by by i Qa, Ag : 
Qa, a, G2 44. 
b, bo} |B, BI’ 


(c) increases k times if the elements of any of its colunns 
or rows are increased k times. For instance: 


ka, b, 
ka, bo| — 


a, b, 


As b, 


i.e. a common factor of all the elements of one row (or co- 
lumn) may be taken outside the sign of the determinant, 
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(d) is equal to zero if the elements of a certain column 
or row are equal to zero. For instance: 


0 0 


(e) is equal to zero if the determinant has two identical 
rows or columns: 


ay a 
= 140, — 44a, =0. 


Ay QA» 


Introduced below are determinants of the third and, in 
general, of the nth order. For them the above listed 
properties hold true. 


Sec. 2. Determinants of the Third 
and nth Order 


The number 
A = Gy1Og9033 + Ay9hg3Hg1 + Ay 3Mq123¢ 
— 13890431 — @y1493439 — Qy2801033) (1) 
written in the form 
Qi, Ayn G43 
A=]/d2y Ago Aes}, (2) 
231 Ago 33 


is called the determinant of the third order. 

In determinant (2) we distinguish the first, second, and 
third rows, as well as the first, second, and third columns. 
The number a;; is called the element of the determinant 
in row 4: and column /, where kis the number of the row and 
Lis the number of the column. The element a, is said to 
be found at the intersection of the kth row and /th column. 

The number of rows (or columns) is the order of the de- 
terminant. The elements a@,,, @y., @g3 form the principal 
(or leading) diagonal of the determinant, and the elements 
213, Ago, Ag; form the secondary diagonal. In other words: 
the diagonal from the upper left corner to the lower right 


Sec. 2. Determinants of the Third and nth Order 9 


corner (on which the elements aj, @g9, @33 are located) 
is called the principal diagonal of determinant (2). 
The structure of expression (1) is rather simple. This 
is a number expressed in terms of elements a;; and com- 
puted according to Sarrus’ rule. Shown below (see Fig. 1) 
is a Sarrus table compiled from the elements of deter- 
minant (2) by adding the first and second columns of the 


~~ > an vi 


> 
¢ A390 a Qs, a | 
go y 3 J \ an XN 


Fig. 4. 


determinant. As is obvious from the figure, we have to 
take all possible products of the elements “crossed out” 
by the straight lines. In doing so, the three products 
indicated by the lines parallel to the principal diagonal 
should be taken with the plus sign, and the remaining 
three products corresponding to the lines parallel to the 
secondary diagonal are taken with the minus sign. This 
is just Sarrus’ rule for determining the signs of terms in 
the expansion of a determinant. 

Every product preceded by a certain sign is called the 
term of determinant (2). Among the elements entering 
into these products there are “representatives” from each 
row and from each column. In each term these elements 
can be arranged in the increasing order of the first index, 
i.e. the index of the row they belong to. This is just done 
in sum (1). As far as the numbers of the columns to which 
these elements belong are concerned, their arrangements 
are given below: 

123 
23 4, (3) 
1 2 


Qo 
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2, A 
oD 2, (4) 


Obviously, these are all possible permutations of the dig- 
its 1, 2, 3. The permutation 


1, 2, 3 (5) 


will be called the main permutation. 

We speak of a transposition of two certain elements in 
a permutation if their places are interchanged. After this 
transposition the permutation turns into another permu- 
tation which, in its turn, after a new transposition yields 
a third permutation (possibly, the first one). For instance, 
the permutation 


3, 2, 4 (6) 


is obtained by transposing the first and third elements of 
permutation (5), and the permutation 


2, 3, 1 (7) 


by transposing the first and second elements of permuta- 
tion (6). 

It is important to note that if a certain permutation is 
obtained from the main one via N transpositions and if 
the same permutation is obtained in some other way from 
the main transposition by means of NV, transpositions, 
then both numbers N and JN, are simultaneously either 
even, or odd. The permutation of the digits 1, 2, 3 is said 
to be even (odd) if it is obtained from the main permutation 
via an even (odd) number of transpositions. 

Let there be given a permutation j = (jj, je, js), 
where j,, je, jg are the numbers 1, 2, 3 arranged in 
a certain order. Let us denote by ¢ (j) the number of trans- 
positions by means of which it is possible to get this per- 
mutation from the main one. Then the permutation j 
is even (odd) if ¢(j) is an even (odd) number. 

Permutations (3) are even, and (4) are odd. 

According to what has been said, we may give another 
(equivalent) definition of a third-order determinant, 
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The determinant of the third order (2) is defined as a num- 
ber A equal to the sum 


A= 3) (— 1)" a43,49;,083, (8) 
Ri 


of products of the form (—1)"Pa,;,a2;,a3;,, Where j = (j4, Jos 
js) are all possible different permutations derived from 
the main permutation 1, 2, 3. 

This definition is generalized for determinants of the 
nth order (n = 1, 2, 

The determinant of the nth order is a number written 
in the form 


| Bog «+e Bon (9) 


- e# ee e® ee ae e 


Dic cia Ogi 


and evaluated by the given numbers a;, (elements of 
the determinant) according to the following law: A is 
the sum 


4G 
A= > (— 1) A435 a95, wae Qnjn 
3 


extended to all possible permutations j = (j,, ..., jn). 
The number ¢ (/) equals the number of transpositions which 
has to be done in order to pass from the main permutation 
1, 2, ..., n to the permutation j = (j,;, ..., j,). The 
product (—1)'a1;, ... dpj, is called the term of a deter- 
minant. 

Determinants of the nth order possess the properties (a), 
(b), (c), (d), and (e) discussed in the preceding section. 

Proof. (a) After certain rows of a determinant are re- 
placed by the corresponding columns, the row numbers 
will be denoted by the second indices. For instance, for 
third-order determinant (2) we shall have 


Ay, Ag Ase 
G42 Fog Ugg | = Aq4AonA33 TF Ag4A30013 1 A34Aq4Ao3 
1443 Gog Ag3 
— 3499043 — 0443993 — Ag404,033 = A. 
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In the general case, the general term of the new determi- 
nant will be written as follows 


Me 
(— 1) @) 0},14;,2 -.- Ajnne 


Now we are going to order the factors of the product 
04,14j,2 - - - Ajnn With respect to the first index, i.e. we 
pass over from the permutation j = (j,, ..., jn) to the 
main permutation 1, 2, ..., m. In doing so, we have to 
accomplish ¢ (j) transpositions. Then the main permuta- 
tion of the second indices will turn into a certain permu- 
tation i = (i,, ..., %,) and the number ?¢ (é) will be of 
the same parity as the number ?# (j). Thus, 


t(#) t(i 
el)? age, casas 


It is not difficult to see that to different permutations 
ji, - ++, Jn there correspond different permutations i,, .. . 
.., £,. But then 


| 9 AG 
2 (1) ays... Ginn = dy (— 1) aay, --. nin =A. 


J 4d 


(b) Let us interchange, for instance, the first and third 
rows of third-order determinant (2). The newly obtained 
determinant denoted by A’ will be equal to 


A344 432 33 
: we t(3) 
A — Qos oo Qo 3 es s (— 1) 435,925,481), 
Qyy Aig 43 J 


(3) 
= S$) (—1)"" 213,423,035, 
j 


a t(7’) 2a) 
—- > (= 1) 245,423,435, = —A, 
j’=(3s, Ja Js) 


since the permutation j = (j,, Je, js) differs from the per- 
mutation j’ = (js, je, ji) by one transposition. 

(c) Multiplication of some row (column) of a determi- 
nant by number & is reduced to multiplication of all its 
terms by &, since every term contains one element of the 
indicated row (column). But then the sum of the terms 
will be multiplied by k, 
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(d) A determinant, the elements of some column or row 
of which are equal to zero, is zero, since all of its terms 
are, obviously, equal to zero. 

(ec) A determinant with two identical rows or two iden- 
tical columns is equal to zero. This follows from property 
(b) (A’ = —A, A’ = A, where A = 0). 

In determinant (9) of the nth order let us delete the 
ith row and kth column. The remaining expression yields 
a determinant M,, of the (n — 1)st order which is called 
the minor of the element a;,. And the quantity 


Aim = (—1)'**Mi, 


is called the cofactor of the element a;,. 

Property (f). The sum of the products of elements a;, 
of some row (column) of a determinant by the cofactors of 
these elements is equal to the value of the determinant: 


A= di andr (i=1,...,n), (10) 
A= di andr (k=1,...,n). (10’) 


Let us prove this property for a third-order determinant 
with respect to the third row. We have 


A34A34 + 3A 30 + 233433 
= Qjyn 243 iy Ay A4q 42 
= as, 3 
Boo Ao3 Aoy 3 Go, Age, 


= gy (AyoHe3 — AygMe2) + Ase (413%21 — 211493) 
+ Ag (211222 — Gy94q1) = A. 
Sum (10) is called the expansion of the determinant in 
terms of the elements of the ith row, and sum (410’) the 
expansion of the determinant in terms of the elements of 
the kth column. 
Example 1. If the determinant A (see (9)) a.; = ag, = 
.. =4n, = 0, then A = a,,Ay,, i.e. evaluation of this 
determinant is reduced to evaluation of one of its cofac- 
tors, i.e. of a determinant of the (nm — 1)st order. 
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Example 2. If all the elements of A standing below 
(above) the principal diagonal of A are equal to zero 
(az, = 9, if ko>l (kK< 0d, then A = ayo. ... Ayn. 
This follows from the preceding example. 

Property (g). The sum of products of the elements aj, 
of some row (column) of determinant by the corresponding 
cofactors of the elements of another row (column) is equal 
to zero: 


[4 


} TipAm= dy ApiAny = 0 (41) 


Ps 
tt 


(i A fj; i, j= 1, ar) n). 

Indeed, let us draw our attention to the first sum. This 
sum is independent of the elements of the jth row. Let 
us now replace the elements of the jth row by the cor- 
responding elements of the ith row. This substitution 
leaves the sum under consideration unaltered. Meanwhile, 
it can be regarded as the expansion of the new determi- 
nant in terms of the elements of the jth row, and as such, 
it then equals the value of the new determinant. But the 
latter, by property (e), is zero, since it has two identical 
rows (ith and jth). 

Property (h). Let there be given two nth-order determinanis 
A, and A, in which all the rows (columns) are identical, 
except for a certain one. The sum of such determinants is 
equal to a determinant A of the nth order in which the indi- 
cated row (column) consists of the sum of the corresponding 
elements of this row (column) of the determinants A, and 
A,. For instance, 


Ay Q4,n-4 Fn Ay, Qin-1 in 

A, +- As == ee «©§ © © © © @© © @ @ © +- *> e© e« e® » @© © #@ @ @ 
ani Qn, nt Onn Qn4 an, nes Ons 

Ay G4, n4 (Bin +O 4p) 


Ani-ee On, n-s (@nn+ brn) 
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Indeed, expanding the given determinants in terms of 
the elements of the nth column, we get 


A, +A2= 2 OrnAnn + 2, Orn Ann 


(@zn + ban) Ann =A. 


I 
iM 3 


Property (i). Jf to the elements of some row (column) 
we add the corresponding elements of another row (column) 
multiplied by number k, then the value of a determinant 
remains unchanged. For instance, 


Qi, +++ Qyn4 Gnthay 
ani Qn, n-t1 2nn + kant 
aq ees Lin Qa4 eee a4, hn—i Qyy 
a eee ee ETE lee arco. Ss Me Rees 
Aan, eee Ann any eae an, n-1 Aan4 


=| an |+h-0=] aa] 


by virtue of properties (h), (c), and (e). 

Being properly applied, this property reduced evalua- 
tion of the given determinant to evaluation of a determi- 
nant of a lower order. 

Example 3. 


145/ 4 4 5] |4 4 5 
23 4/=10 —5 —9l=|0 —5 —9 
8414] 18 41 #4! jo —31 —39 

—5 

= 215.99 96-3728 4. 

—34 en Be 207 me : 
Example 4. 
24114 01 O 
(32)—/-5 3 -4/—=-|7? “3l=|_° |= 17 


146 —7 4 2 
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Example 5. The determinant 


1 Gy Oe wee. al 
w—1 
ee nn 
1 a, w@ an} 
generated by the numbers a, @, ..., @, is called a 


power determinant or Vandermonde determinant*. 

This determinant is equal to zero if any two numbers 
a; and a, are equal to each other. If all a;’s ave different, 
then 

An = (@n — 9) (Qn — 44) «. . (@g— G4) 
* (An — Gg) (An-4 — Gq) .. « (Ag —~ Ge) 
* (a, — An-2) (On4 a Ono) : 
* (Bn — Bn_4)- (12) 
Indeed, for n=2 


1 ay 


A,= = dz — 44, 


1 a 


i.e. formula (12) is true. Let us assume that formula (12) 
also holds true for n = k — 1, and prove that it is true 
for n = k. In doing so, we shall take advantage of proper- 
ties (i) and (c) of determinants. Multiplying the (k — 1)th 
column in the determinant A; by @, and subtracting it 
from the kth column, multiplying the (4 — 2)th column 
also by a, and subtracting it from the (& — 1)th column, 
and so on, we then obtain: 


1 0 0 ae 0 


2 
Aye 1 @y—@, Gy (Q@g—ay) --. 2°" (@g — 4%) 
k 
2 
1 @y—@, a, (@,—a@,) --- a, ~ (Qp—4) 


* Vandermonde, Alexandre Théophile (1735-1796). French 
algebraist. Gave first logical exposition of the theory of determi- 
nants, 
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4 Qo az? 
1a Ae 
a (a, — 44) (Gp-4 — @y) eee (dg — a) 3 a 


4 ap ee an 


The latter is also a Vandermonde determinant of order 
(k — 1), generated by the numbers a,, ..., a,, therefore, 
by the assumption, we have 


Ay = (@y — 44) (€p-4 — 44) - - - (€2 — 4) - 
*(A_ — 4g) (Ap4— a) - . . (2g — Aa) « 
* (Ay — Ap~2) (Ap-4 — @p~2) ° 
-(@;, — Gps). 
Thus, by the method of mathematical induction, for- 


mula (12) holds true for any n> 2. 
Property (j) Let 


Ay=|5n1|, Ao= | aqui |. 
The product 


AyAo=| p2 by jay |= A, 


i.e. the product of two nth-order determinants with the ele- 
ments by, App is also a determinant of the nth order with 
the elements 


n 
Vu = za bp j0j1- 
j= 


Thus, the element ,; belonging to the Ath row and 
lth column of the determinant A is said to be equal to the 
product of the kth row of the determinant A, by the lth column 
off{the determinant A,. Actually, it is the sum of products 
of the elements of the Ath row of the determinant A, 
by the corresponding elements of the /th column of the 
determinant Ag. 

Since in the determinants A, and A, rows and columns 
may be interchanged, obviously, the elements y,; of the 


2—01496 
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product A can also be constructed by taking the product 
of the kth row of A, by the ith row of A,, or the product 
of the kth column of A, by /th column of Ag, or of the 
Ath column by the /th row. 

Proof. Let us make sure that this property is correct 
by considering a particular example of second-order deter- 
minants: 


Oy Oe 
Do, be» 


Aq, Aye 
A, = 


a=] V4e 
Vo1 Yee 


9 As —S ’ ‘ 
Ao, Aa2 


where 
Var = Oya, + 91221, Vag = Oye + Dy 2de2, 
Yer = Dei@y1 + Bo2G01, Yan = Oa18ye + Oa2doo. 
By properties (h), (c), and (e), we have 
DyoGng 944Q4o + DyrAne 
Boole, Da4@42 + Da2Qee 
bis bas big Bi; 
bay Oe boo Das 
Dyn Ous 
Doo Ong 


BysQ44 b44Q42 = By2A20 


bo 4Qyy  Gg4Q qq + DoeGoe 


by Ou 


b b + G42Qe4 
a4 Vos 


= 4444842 + A449 


Dye O42 
bee Do» 


H+ g4Q qo +O = AyyAgnAy — A ynGasAy 


+ Qe4Ao9 = B4yAyo°O + Ay Aaah + AyoQo4 


= Ay (@44@q — G49@o4) = AyAg. 


In the general case of nth-order determinants it is 
possible to write 


n nN n 
NI 
= A4s,9811 » 13,9542 oes pa Bis,0sin 
831 ss=1 $,=1 
n n n 
A — > G25,0s,4 ey 2;,05,2 oe 8 > 49s,0s.n 
s,=1 8,==1 8 5=1 
mn n 
>; OnsnOsn AnsnPsn2 ere >» OnspPs_n 
sn=1 sn=i1 sn=i 
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Ds4 b.,2 Dia 
. . 5 b b 
1 s 2 eee 
= > >, coe py Qis,42s8, «++ Ansn 82 a Sgt 
sy=1 s8,=—1 sn=1 * © © © ep & 8 © ee 8 8 
Bsn Bsn eee Dsan 
t(s) 
= >; Q45,22s, eee ansn (— 1) Ay = /A,A,. 
s= (51, 8p, eo aeys Sn) 


In computing individual elements of A we have the right 
nr 


to choose any summation index s (yz; = >} @ngs1), but 
1 


s= 

for the further reasoning it is convenient to take s, as 
such an index for the first row of A, s, for the second row, 
and so on. The second eae ey. is due EO properties (h) 


and (c), the multiple sum s S ae s being extend- 
Sy=1 s,=1 sn=i 
ed to all possible permutations (Sy, So . +» S,), where 


1<s;< n. But if in some system (s,, S,, ..., S,) two 
components s; and s; are equal to each other (s; = s;, 
i= j), then the neighbouring determinant | 6,,, | = 0 
Therefore, actually, in the multiple sum we may leave 
only the terms corresponding to different permutations 
(S,, ..., &,) from natural numbers (1, ..., m). And, 
obviously, it will turn out that the corresponding deter- 
minant 
| Bayt | = (AYA. 


Sec. 3. Matrices 


A rectangular array of numbers a,; of the form 


qe nee Hy 
=( \=tayt=(ed, 


Cnt: «a ‘Opn 
(4) 


consisting of m rows and n columns is called the matriz. 
The numbers a;; are called its elements. If m = n, then 
we have a square matrix of the nth order or dimension. 
In other words, a square matrix is a matrix for which 
the number of rows is equal to the number of columns. 


28 


iy +e An 


A= 


Ami eee Amn 
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The order of a square matrix is simply the number of 
rows (columns) in the matrix. 

If a second matrix is given: B = |{ §;; || with elements 
6;; also consisting of m rows and 7 columns, then it is 
regarded as equal to the matrix A, if and only if the 
corresponding elements of both matrices are equal (a,;; = 
= f,,;). In this case we write A = B. The matrix || @;; || 
is not a number—this is an array. But for a square mat- 
rix it is possible to consider the number | a;; | which is 
a determinant generated by this matrix. 

Let & be a natural number not exceeding m and n 
(k <m, n). Let us now cross out some & columns and k 
rows in array (4). Then the elements a;, found at the 


intersection of the deleted columns and rows form a square 
matrix which generates a determinant of order k. The 
.determinant thus obtained is called the kth-order determi- 
nant generated by matrix A. 

The greatest natural number & for which there exists 
a nonzero determinant of the Ath order generated by 
matrix A is called the rank of matrix A (see Note 2 of the 
next section). 

If in matrix A its rows and columns are interchanged 
(leaving the same numbers), then we obtain the following 
matrix 


x 
ee e@ @ © @ @ 4 80 @ 


AY = 


which is called a conjugate matriz or a transpose of matriz A. 

Matrices of one and the same dimension, i.e. containing 
one and the same number of rows and columns, can be 
added. The sum of two such matrices A = || a,j; || and 
B= || Bi; || is a matrix C = || y,; |], whose elements 
are equal to the sum of the corresponding elements of the 
matrices A and B: yj; = a3 + Byy. Symbolically, we 
shall write this fact in the following way: 


At+B=C. 
It is eaSy to see that 
A+B=8-+4, 
(A+ B)+C0=A+(B+0C). 
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The product of the number i by matrix A (or the product 
of matrix A by the number A) is defined as a matrix whose 
elements are equal to the product of the number A by 
the corresponding elements of matrix 4. Hence, AA = AA. 

Example. Let 


1 0 2 201 
A=(, 3 Ae B=(; -, 
Find the matrix 1A + wB. 


On the basis of definition of the sum of matrices and 
the rule for multiplying a matrix by a number, we have 


7 0 2A Qu 0 ) 

NA = 9 = ? 
& 3A 7 ad bs up 

M+2p 0 2A+u\ 

VA +B = 
a La Shh A+B 


Sec. 4. Systems of Linear Equations. 
The Kronecker-Capelli Theory * 


An arbitrary system of n numbers (z,, ..., 2p) will be 
called an n-dimensional vector. We shall denote it by 
one bold-face letter x: 


: Se ne ay a 
The numbers z; (jf = 1, ..., ”) are called components 
of the vector x. The vector 
0 = (0, ..., 0) 
is termed a zero vector. 
Let us be given a system of n linear equations in n 
unknowns 
AyyLy oes + An tn = Ys, 
Se ach Ai ec etc ec dee (1) 
AnyXy +--+ + AnnItn = Yn- 


* Kronecker, Leopold (1823-1891). German algebraist and 
algebraic number theorist. Capelli, A. (1855-1910). Italian mathe- 
matician, 
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The numbers a,; called the coefficients of system (1) 
are specified. We shall also say that system (1) is defined 
by a matrix of its coefficients: 


Lig eee Qin 
A=lay:|}=]} ---- - - ; (2) 
Gni+++ ann 


We shall be interested in the problem of solvability of 
system (1) for each vector (system of numbers) 


Y= (Yr ++ +> Yn): 
A system of numbers (vector) 
KX = (%, -~ +) Zp) 


is called the solution of system of equations (1) if the num- 
bers z; satisfy these equations. 

Theorem 1. /f the determinant of system (1) is not 
equal to zero: 


A= |ay | ~ 9, 
then system (1) has a unique solution for any vector y which 
is computed by Cramer’s formulas* 
t,;= N/A (G=1,..., 7), (3) 
where A? is a determinant obtained from the determinant A 


by replacing in it the numbers of the jth column by the 
respective numbers Yj, .-.-; Yn: 


Qyy4 + -- Ay fea Ya 4, f4t >> Bin 


IN Se ies ah meee ds Ree Ee a ate Acre ok (4) 
an4 Qn, j-1 Yn @n, j+i ann 
Hence, 
TN 
ry= > Sy Asi¥s (ids ey 2), (3’) 
s=1 


where A,;is the cofactor of the element a,;in the determi- 
nant A. Ke 


* Cramer, Gabriel (1704-1752). Swiss mathematician and phy- 
sicist, | | a 
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Proof. In order to find the unknown number z,, mul- 
tiply the first equation of system (1) by the cofactor A,,, 
the second equation by A.,, ..., the nth by Ans, and add 
all the equations of the given system. Then, taking into 
consideration that 


nT nT 
p2 Api X,Apy= Ly 2 AnyApy = 2,A 


and 
Tt 


pa) Ap j0j Any = Fj 2 AnjAny=2j-O0=0 (Jj 1), 


we get 7,A=A!, where 


s=i Yn Gneg +--+ Ann 


a ct ae since, by hypothesis, A 0, we have x, = 
== NA 

In the general case with an arbitrary j/, we multiply 
the first equation of system (1) by A,;, the second by 
Ao; ..., the nth by A,;, add these equations, and, by 
virtue of determinant properties (f) and (g), finally get 
the equation 


1 k=1 
1.e. 
x; = A’, 
where 
n Qyy ees By, 5g Ya 4, fa +e Aan 
Aj= > Y,Arj= ee a ee ea ee ee ee 
k=1 ant ee © an, j-1 Yn an, iti eee Ann 


Hence, by the fact that A =4 0, there follows equality (8). 
A system of equations of the form 


yyy -T cae + Ayn Lp = 0, 
Os aha: WE Et a Mae i eed Ss (9) 


AnjXy + 6. + Anrt, =O d 
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is called homogeneous. It is a particular case of system (1) 
fory,; =...=y, = 0. It is obvious that the zero vector 


C—O) og os ee SO 


satisfies homogeneous system (5). But it may happen that 
homogeneous system (5) is satisfied by a nonzero vector 
K = (4, ..., Lp), i.e. by a vector having at least one 
component zx; =4 0. This vector is called a nontrivial solu- 
tion of homogeneous system (5), and zero vector is there- 
fore called a trivial solution of homogeneous system (5). 

Theorem 2. Jf the determinant A of homogeneous system 
(5) is not zero (A = 0), then this system has only a trivial 
solution. 

Indeed, by property (d), all the determinants A’ = 0 
(see (4)), therefore, by virtue of equalities (3), x; = 0 

i Pee ee oe 

Theorem 3. /f system of equations (5) has a nontrivial 
solution, then its determinant A is necessarily equal to zero 
(A = 0). | 

Indeed, if A were not equal to zero, then, by Theorem 
2, system (5) would have only a trivial solution. 

In the aforegoing paragraphs we investigated linear 
system (1) for the case when its determinant A = 0. 
In this case it was shown (Theorem 1) that system (1) has 
for any right-hand member of y = (y,, ..., Yn) a unique 
solution which is computed by formulas (38). 

We now pass over to the case when the determinant of 
this system is zero (A = 0). We shall assume that at 
least one element of matrix A (see (2)) is nonzero and 
denote rank A by k (k = rank A). Thus, 1<k <n. 

Our purpose is to prove the following rules (they were 
formulated and proved explicitly by Kronecker and 
Capelli). 

If we want to solve system (1) for which it is known 
that the rank of the matrix A of its coefficients is equal 
to k, we must find out the rank of the augmented matriz 


An, eee Ann Yn 
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obtained by adding to A the column 


Ya Ys 
Yo tt Yo 
Yn Yn 


(1) If rank B is greater than rank A (rank B > rank 
A =k), then system (4) has no solution at all. It is 
contradictory, since there is no vector x = (2, ..., &,), 
satisfying simultaneously all equations (1). 

(2) If rank B equals rank A (rank B = rank A = hk), 
then system (1) has solutions. In order to find them, we 
must choose from system (1) some / equations the matrix 
of coefficients of which has a rank k, and solve these k 
equations. This system of k equations will have an infinite 
get of solutions, but they can be written in a visible man- 
ner. 

In this case any solution of the & chosen equations is 
automatically a solution of the remaining n — k equa- 
tions of system (1). 

Rules (1) and (2) exhaust all possible situations, since 
rank B cannot be less than k. 

As a matter of fact, matrix A, by hypothesis, generates 
a nonzero determinant of order k which isalso generated 
by matrix B. 

Example 1. The system 


r+ty=1, 
z—y=2 
has a determinant 
1 1 
Nl senile se 


and therefore has a unique solution which can be computed 
by the formulas 
x= AYA, y = A7/A, 


where 
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i.e. 


pea pee 
—2 7 as —2 ae 
Example 2. The system 
c+y=1, 
a+y=2 (©) 
has a determinant equal to zero. The matrix 
1 4 
A=l4 4 
has a rank A=1. And the matrix 
444 
a i 1 2 


has a rank B = 2. Since rank B > rank A, system (6) 
has no solution. However, this is seen without our theory: 
one and the same number cannot be simultaneously equal 
to 1 and to 2. 
Example 3. The system 

2x -+ 2y = 2, 

ox + 3y=3 
has a determinant A = 0. The matrix 
2 2 
3.3 
has a rank A=1. The augmented matrix 
2 2 2 
3.3 3 


also has a rank (B) equal to 1. Since rank A = rank B = 
= {, we take one equation 


2x + 2y = 2. (8) 
As is seen, the coefficient of y is not equal to zero, there- 
fore this equation can be solved with respect to y: 


y= == =1—x, (9) 


(7) 


ig 


— 
— 
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Formula (9) yields all solutions of equation (8). We can 
assign any value to c (— oc. <1 z < oo) and compute the 
corresponding value of y by formula (9), thus obtaining a 
system (vector) (x, y) satisfying equation (8). The set 
of all systems (z, 1— x), where x € (—0o, oo), forms 
the set of all solutions of equation (8). These solutions 
are automatically solutions of the second equation of 
system (7), since rank A = rank B. In this case the result 
is obvious without applying the theory of matrix ranks. 
The coefficients of equations (7) as well as the right-hand 
members of these equations are respectively proportional, 
therefore it is clear that any solution of one of these equa- 
tions is at the same time a solution of the other. 
Example 4. The system 


L+y+ z= 1, 
ax -+y+taz=1, 
xtyt+32=2 
has a determinant 
1-24: 
A=|2 1 2|=—240 
11 3 


and, therefore, has a unique solution which can be com- 
puted by the formulas 


1411 
At —1 _ 1 ms NE 
Cae ora ee Arce 
2 At 6 
1441 A: 4: A 
_-1 _ AS —-4 4 
123 44 2 
Example 5. The system 
zt+y+ 2=1, 
rt+y-2z2=1, 


x+-y+32=¢ 
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has a determinant 


The matrix 


hasarank A = 2, sinceA = 0, but there exists a nonzero 
second-order determinant generated by matrix A. For 
instance, 

| ee | 


1 8 ato 


The matrix 


1411 
B=|}1 1 2 14 
113 2 


has a rank B = 3, since the determinant generated by 
this matrix 

es | 

12 1\;=1540. 

13 2 


Since rank B >rank A, the system has no solution. 
Example 56, The system 


z+ y+ 2=4, 
x rrimtt (10) 
24+ 2y+42=2 | 


has a determinant 


- 
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It is easy to compute that the matrices 


i a 14144 
A=|[1 1 2], Bal[1 1 2 4 
22 4 2242 


have equal ranks, where rank A = rank B = 2. Let us 
choose from system (10) two equations so that the rank 
of the matrix A’ formed from the coefficients of these 


‘equations is equal to 2. In this case it is possible to take 
_ the first and second equations, or the first and the third. 


Thus, let us consider the system 


x+Y+ z=, 


xty+2z=1. “D) 


We transpose to the right-hand sides of these equations 
_ one of the unknowns so that the coefficients of the re- 


maining unknowns form a matrix A” whose rank A” = 2. 
Here we may transpose either z or y. 
Hence, the nonhomogeneous system 


w+ g=1—y, 

One aes (12) 
has a determinant 

1 41 
salt *laiaes 
therefore it has a unique solution for any right-hand side: 
1 {|t—y 1 4 [1 4—-y 

PAPAL cogs igure SEM Ape | ve 


Thus, for any y the sets of three numbers (4 — y, y, 0) 
yield all solutions of system (12) and, automatically, 
the solutions of the third equation of system (40) (this 
equation is obtained from the second by multiplying 
it by 2). 

Theorem 4. If system (1) is compatible, i.e. if it has at 
least one solution x, then it is necessary that rank B be 
equal to rank A. 
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It is easily seen that statement (1) (see p. 20) imme- 
diately follows from Theorem 4, since, if rank B > 
>>rank A, the necessary condition for compatibility 
(consistency) of system (4) is not fulfilled. 

Proof of Theorem 4. Let system (4) have a solution 
and rank A =k. We have to prove that rank B =k. 
Since, by hypothesis, rank A = k, there exists a nonzero 
determinant of order & generated by matrix A and, con- 
sequently, by matrix B as well. Therefore, rank B > k. 
Now we have only to prove that any determinant of 
order (« + 41) generated by matrix B is equal to zero. 
If such a determinant consists only of elements a,,;, 
then it is automatically zero, since it is also generated 
by matrix A that has a rank k, by hypothesis. Hence, we 
have to prove that any determinant of order (k + 1) 
generated by matrix B and containing a column formed 
by numbers y; is equal to zero. Without loss of generality, — 
we may hold that this is the determinant 


eo © &© @ %@ #+j(@& ee $@ @  @®  @  @ @ 


Anaya «++ Apsik Yries 


Any case can be reduced to this pattern by accordingly 
renumbering the equations and unknowns z;. By hypothe- 
sis, system (4) is compatible, i.e. there exists a vector 
X = (%, ..., £,) satisfying the equations of this system. 
But then, in particular, x satisfies the first k + 1 equa- 
tions of the newly renumbered system, Consequently, 


bose Head ta o Gee Gaceaeee  e (13) 


hy = ngy2rgy b---+Qintn —Y4, 


Maes = nea, ntiTrea tb +++ + Onss, nln — Yas 
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Let us set up a system with the unknowns 2, 29,... 
09 Sp+4- 
14124 tee OyeZ, tb Aza «== 9, 
Bois Ge ake ee ee, GR SS ete: ane wk ree Gee (15) 
On+4, 424 +++ + Ong, 2n + Ansa es = 0. 


On the basis of (13) and (14), this system is satisfied by 
the numbers x,, ..., 2,, 1 one of which, at all events, 
is not equal to zero. But then the determinant of homo- 
geneous system (15) is zero (see Theorem 3), i.e. 


yy ee Ate A a Taty ee Ann — Yt 
O — } «© 8 © «© «© 8» #© @© e@ «© @® #8 « 8 8 #8 © #8» ww» @ © #8 8© © @© 8 8 «@ 
Qhti,i+++ Grea,k Ones, ntitnsyt - ++ Onsy, nin — Yrs 
nr Qa eee Qap Qis 
= On al nbd ob os eo ee 
se=h+1 Gn+i, 4 Qnii,k Ak+i, s 
Q44 Aqp Y4 
Grt+i,i -++ Qnti,k Yae+i 
Gay ++ Aqp 4 
= S -—~f «© «©«©§ «© © © @© @ @ 6 ) ry e . ; (16) 


Oniya +++ Arsar Yrei 


since the determinants (of order 4 + 1) entering into 
the sum >) are equal to zero, as a matter of fact, the 


rank of matrix A is equal to k. 

Thus, we have proved that any determinant of order 
k + 1 generated by matrix B is equal to zero, which was 
to be proved. 

Now we are going to justify rule (2). Since the rank 
of matrix A is equal to & (rank A = k), system (1) con- 
tains & equations, the matrix of coefficients of which 
generates a nonzero determinant of order k. By renumber- 
ing the equations and unknowns, it is possible to achieve 
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that the first k equations of system (4) 
Ayty to. + Ant, = Yi, 
ache tact fate Ried ene (1') 
Apyly- --- + Apntn = Vp 
will have the determinant 
Qyy +s gp 
O=}-.--.--- (0, 
Qpi .-- Ann 
Renumbered system (4’) can be rewritten as follows: 
Ayyhy Op Ty = Yy— G4, nt pTap4 — +--+ — AUnZp, 


eo @¢ @€@ @ @ @ ®  @  @  @ @ @®  @  @®@  @ ©  @  @  @® oe  @ @ @ ee @®@ @ @®  @ 


Ayylyt ++ + Ann Le =Yn— Ap, ntilnti— ++» —ArnXn 
(17) 
Since the determinant o = 0, to any system of numbers 
Ln4+1. +--+, £, there corresponds a unique system of 
numbers z,, ..., Z, which, obviously, can be written 
in the following way: 
k 
eee 
“=o (Ys— Ms, n+sZht1— +--+ —AsnXn) Agy, 
s==1 
a | 
o 
th Fog, > (Ye— Gs, h4i%ng1— --+— Ann) Age, } 
e=x f 


(18) 


where A,; are cofactors of the elements a,; in the deter- 
minant o. Consequently, all the solutions of system (47) 
are written by formula (18). We may assign any values 
to the numbers x,4,, ..-, 2,, and the numbers z,, ... 

.., 2, will be computed by formulas (18). Hence we 
see that system (17) has an infinite set of solutions. 

We want to justify that any found solution x of the 
first k equations of system (1) is automatically a solu- 
tion of the remaining equations of this system, provided 


7 
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that rank A = rank B = k. For the sake of definiteness, 
let us prove that it is the solution of the (4 -+ 1)th equa- 
tion. Consider the first (4 + 1) equations of system (1) 
written in the form of (13). We have to prove that any 
solution x of the first k equations of (13) is automatically 
the solution of (4 + 1)th equation in (13). Let x be 
a vector satisfying the first k equations in (13). We set 
up equations (15) with respect to the unknowns 2, ... 
. «+, 2n4+4, Where the numbers A,, ..., An+, are com- 
puted by formulas (14) in terms of the components 
Lh+iy +++) L, Of the vector x. The determinant of 
system (15) is equal to zero. This is seen from equations 
(46) which should be read from right to left. By hypothe- 
sis, the determinant on the right is zero. But then system 
(15) has a nontrivial solution z,, 25, ..., 2,41. Here 
the number 2,4, 340 because, if we assume that sys- 
tem (15) has a solution of the form z,, ..., Z,, 0, then 
the numbers 2,, ..., 2, must vanish, since the deter- 
minant o = 0. But then z; =... = 2, = 2n4, = O and 
the system 2,, ..., 2,4, would be a trivial one. Since 
system (15) is a homogeneous one, not only the numbers 
Z1, +++) 2k+3, but also the numbers 


c / 
2, = 2/241, «+ +y 2 = Fp/2n41, 1 


possess the same property (2,4; 70). But then the 
numbers z,, ...-, Z, satisfy the system of the first & 
equations of (13), which has a nonzero determinant 
(o = 0). We already know that this system has solu- 


tions 2, ..., Z,, and, by virtue of uniqueness, 

ZB, = By, soy Th = Zp. 
Considering the last equation of (15), we see that it is 
satisfied. by the numbers; (a1, ..., 2x, 1), i.e. the 
numbers (%,, ..., 2,) satisfy the (k + 41)th equation of 
system (13) and, by virtue of (44), the vector under 
consideration x = (2,-..-., 2p, Let, +++, %) satisiies 


the (k + 1)th equation of system (1). This proves state- 
ment (2). 
Remark 1. Hereby we recommend the following method 


-for solving systems of linear equations, which is, as 


a matter of fact, the method of elimination of unknowns. 
3—-01496 
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Let there be given a system 
Ayly t+... + AynpSn = Oy, 
ee a ee ee re (4”) 
Anyly +. -- + 4nnty =b,. 


Multiplying some equation of this system by a constant 
number and adding it to another equation of system (1”), 
we obtain a new system which is equivalent to the given 
one. The new system of equations will have its own 
matrix B' apprepriately transformed from matrix B 
(B = B’). The transformation consists in that a certain 
row of matrix B is modified by adding to it another row 
multiplied by an appropriate number. Technically, 
instead of writing the new system of equations, we confine 
ourselves to writing only its corresponding matrix B’. 
Using properly the indicated operations on a system of 
equations, or, which is the same, on matrix B, it is 
always possible either to find the solution of the given 
system (1”), or to arrive to an explicitly contradictory 
system. Since the latter is equivalent to system (1”), 
this will prove the inconsistency of system (1”). 

Given below are several examples illustrating how 
this method is applied. 

The operation B = B’ means that B’ is obtained from B 
in one of the following ways: (a) by multiplying one row 
of B by a nonzero number, (b) by changing a certain row 
of B by adding to it another row multiplied by a certain 
number, (c) by applying operations (a) and (b) a finite 
number of times, (d) by interchanging rows. 

Example 7. Solve the system 


4+ 2r,+ 3x5-+ 4x, = 5, 

Lo + 223+ 32,= 1, 
x4 + 325+ 4a, = 2, 
r,+ 2+ 0%,+ 62, = 1. 


Of course, by Theorem 1, we could evaluate all the 
five determinants of the fourth order and find 2,, r., 13, x,. 
We would have to go through many repeating calcula- 
tions. 
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Form malrix B: 


1 
) 
4 
{ 


aw Ne & 
Oo we GC RS 


2, D 
1 4 
0 27’ 
1 1 
in which, as we see, the last column consists of right-hand 
members of the given system. Multiplying the first row 


by (—1) and adding it to the third and fourth rows, we 
get the matrix 


1 234 5 
fo 123 1 
B=1o0 200 —3 
Of a OD aad 


In this matrix the elements of the third row (excepting 
one) which are coefficients of the unknowns are equal 
to zero. Let us interchange the second and third rows. 
Then the nonzero element of the third row will be found 
on the principal diagonal: 


1 234 5 
oO 0 8 
“fo 4283 4 

C=1 oo =f 


In order to simplify the notation, we multiply the second 
row by —1: 


1234 5 
0 200 38 
B=1o0 423 4 
0-122 —4 
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Further transformations of the matrices are obvious: 


1034 2 
0200 38 
Bi> Bs=1 9 0 2 3 1/2 [8s 
0022 —5/2 
103 4 2 1034 2 
020 0 8 0200 3 
=lo02 383-1247" =10023 —112 |" 
000—1—2 0001 2 
1030 —6 1000 45/4 
0200 3 0200 8 
=lo020 —13/72 |72e=1 0020 —13/2 
0004 2 0004 2 


Hence, 2, = 2, #3 = —13/4, x, = 3/2, 2, = 15/4. In 
order not to commit an error, it iS recommended to 
effect a check by substituting the obtained values into 
the initial equations of the given system. 

Consider Example 5 from this point of view: 


Hy Le + L3= 1, 
Lyt4,+22,= 1, 
Ly + L_-b ot4a= 2 


{11414 11144 11444 
p-(1121)+2-(001 a} a.-[001 0) 
1132 0024 000 4. 


Hence, the initial system is equivalent to the follow- 


ing one: 
X,+ te+ 2,=—1, 
0-2, +0-2,.+ a 
O-2,+0-2,+0-2,= 14. 


The constant term of the last row is equal to unity, and 
the coefficients of the unknowns are zero, therefore the 
system is incompatible. 
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Finally, in Example 6 


1144 1414 
B=|1121)>B,=|001 0]>B, 
2042 0020 
eae 1114 1101 
—~10010 >Pi=(5 54 o)*%=(004 0) 
0000 


Hence, x, = 0, x, + x, = 1, i.e. the system has an 
infinite set of solutions: x, = 7,, #, =1—27, rz; = 0, 
where 2, is any number (—oo <( 2, < oo). 

Remark 2. And if we are interested only in the rank 
of matrix B, then the above indicated operations B => B’ 
should be extended not only to the rows, but also to the 
columns of the matrix. If in the process of these trans- 
formations there appears a row or a column consisting 
only of zeros, then they should be deleted and a matrix 
of a smaller dimension should be given further con- 
sideration. 

This method, known as Gauss method* is illustrated 
by the following examples. 

Example 8. Find the rank of the matrix 


12345 6 7 
O10140i1 41 
P=14212121 
00000 4 5 

It is clear that the rank of matrix #& is not greater 


than 4. In this case a,, = 10. Multiplying the first 


* Gauss, Carl Friedrich (1777-1855). German mathematician, 
usually considered along with Aristotle and Newton to be one 
of the three greatest mathematicians of all time. Made important 
contributions to algebra, analysis, geometry, number theory, 
numerical analysis, probability, and statistics, as well as astrono- 
my and physics. 
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row by (—1) and adding il to the third row, we get 
1 2 3 4 5) 6 7 
0 1 0 1 0 1 4 
00 —2 —2 —4 —4 —6 
0 0 0 0 0 4 o 


Multiplying now the first column by the corresponding 
numbers and adding it to the remaining columns, we 
obtain the matrix 
1 0 0 0 0 9) 0 
. 0 1 0 1 0 1 1 
2 NO: 10) ee: 0 ee eA eG 
0 0 0 0 0 4 i) 
The second column now consists of zeros, except for the 
element az, =10. Multiplying the second column 


by (—1) and adding it to the fourth, sixth, and seventh 
columns, we get 


5, = 


10 0 0 0 0 9 
01 0 0 0 0 
By = Ee 4 2 4 ~4 —6 | 
0 0 0 4 5 
10 yreer 10 000 
01 00000 04 000 
= ett 00 -200]7”% 
00 wees 00 045 
000 10 00 
- 000 01 00 
=VO0.=2 00) >" | 0020 
00 040 00 OA 


The fourth-order determinant of matrix B, is not 
equal to zero, consequently, 


rank B = rank B, = 4. 
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Example 9. Find the rank of the malrix 


1410 0 QO —1 —1 — 
4 0 0 0 1 0 
-(0 1 1 1j> B,=]0 14 
0 —1 —1 —1 0 0 


“(ft 0?) ee ee baie 
=(o 41 1) eB=lo 4 0 0) B= (0 1) 


i.e. the rank of matrix B equals two. 


1211 4 2 4 { 
0 
4 
0 
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Cartesian System of Coordinates * 


In this section we shall consider a real space. The 
concept of the vector in a real space is familiar to the 
reader from elementary geometry. 

The vector (in a real space) is defined as a directed 


—>- 
line-segment AB with initial point A and terminal 
point B that can be displaced parallel to itself. Thus, 


—> —_——> 
it is considered that two directed vectors AB and A,B, 
having equal lengths (| AB | = | A,B, |) and one and 


the same direction define one and the same vector a, 
—> —s 
and in this sense we write a= AB = A,B, (Fig. 2). 
—> —> 
The length | AB | of the vector AB is a nonnegative 
number equal to the length of the line-segment AB join- 


ae 
ing the points A and B. We shall also write | AB | = 
= | AB |. 

Vectors lying on parallel straight lines or on one and 
the same straight line are termed collinear. 


* Note that the present book first deals with the scalar pro- 
duct of two vectors, then with analytical geometry of the straight 
line and plane, and after this, in Secs. 11-13, the concepts of the 
vector and scalar triple products of vectors are set forth. If desired, 
these sections may immediately follow Sec. 6. 
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—> —> 

If the points A and B coincide, then AB = AA = 0 
is also regarded as a vector, the null vector. Its length 
is equal to zero ({0 | = 0), and direction has no sense 
for it. However, for the sake of generality of the rules 


BB: 


Fig. 2. 


of vector algebra, it is agreed that a pair of coincident 
points is to be regarded as a null vector. The latter is 
considered collinear with any vector. 

The projection of a point A on a straight line L (Fig. 3) 
is a point A’ at which the line LZ cuts the plane perpen- 
dicular to Z and passing through the point A. 

Let there be given a directed straight line L (Fig. 4) 

—> —+> 


and vector a = AB. The projection of the vector a = AB 


— 
on the directed line Z is the vector A’B’, where A’, B’ 
are the projections of the points A, B on L, respectively. 


=> 
With the given directed line L, the projections A‘B’ of 


—> 
any vectors AB lie in Z and are either in the same direc- 
tion as LZ, or in the opposite direction. This enables us 
to express the projections of vectors on a given directed 
line also by numbers and, hence, we can give another 
definition, this time an algebraic one. 


—s 
The projection of the vector a = AB on the directed 
Straight line L is defined as the product of the length of 


pees 
the vector a = AB by the cosine of the angle w between 
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Fig. 4. 


the vector a and.the direction of L: 
prra = |a|cos(a, Ll) =|alcosaoQO<co<can). (1) 


The transition from the first definition to the second 
is carried out in the following way: prz a is a number 


aay 
equal to the length | A’B’ | taken with the sign + or — 


—> 
depending on whether the vector A’B’ is in the same 
direction as LZ or in the opposite direction. 

In geometry, we consider addition and subtraction 
of vectors, as well as their multiplication by real numbers. 
By definition, the product aw = aa of a vector a by 
a number a or a number @ by a vector a is a vector whose 
length | aa| = |aJ]-|aJ], and the direction coincides 
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with that of a, if a > O or opposite to a if a <0. For 
a = 0 the length | wa | is zero and the vector aa turns 
into a null vector (point) having no direction. 

By definition, a finite number of vectors a, b, ¢, ... 
are added according to the rule for closing the “chain” 


a+bie+d 


Fig. 9. 
a 
b a 
a-b 
Cc b 
Fig. 6. Fig. 7. 


of these vectors. Figure 5 and 6 recall the way this is 
done. Figure 7 shows how vectors are subtracted. 

The projections of vectors a, b on a given direction L 
possess the following properties: 


pry, a + pry b = pry (a + b), (2) 
pry (wa) = & pry, a. (3) 
Property (2) is illustrated in Fig. 8: 
-——>  +--—> --> 
pry atpry, b= AR’ 4 BIC’ =A'C' =pry c= pry (a+b). 
Since a= (a — b) + b (see Fig. 7), we have 


pry, (a — b) + pr; b = pry, a, 
and, consequently, 
pry a — pry, b = pry (a — b). (2°) 
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Let us prove property (3). Taking the angle between the 
vector a and Lhe direction L equal to m, we have 


for a>0: _ pry (aa) = | aa |cosw =a ilal|coso = 
= &% Pry a, 
for « <0: pry, (wa) = | aa | cos (n — «) 


= —a|a|cos(xn — wo) =ala|cosw =a pry a. 


As is known, for « < 0 the vector qa is in the direction 


Fig. 8. 


opposite to that of a, and if a forms an angle » with L, 
then aa is at an angle nm — q to L. 

For a = 0 both sides of (3) vanish. 

A scalar product of two vectors a and b is defined as 
a number (a, b) equal to the product of the lengths of 
these vectors multiplied by the cosine of the angle w 
between them: 


ab = (a, b) = |a | |b | cos (a, b) = [| a[|b|cosw. (4) 


Obviously, we may also say that the scalar product of 
vectors a and h is the product of the length of the vector |b | 
by the projection of the vector a on the direction b or the 
product of the length | a | by the projection b on the direc- 
tion a: 


ab = (a,b) = |al|preb= |b prya. 
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The scalar product possesses the following properties: 


(a, b) nae (b, a), (5) 
(a, b + ce) = (a, b) + (a, c), (6) 
(a, ab) = a (a, b). (7) 


Equation (5) follows directly from the definition of 
the scalar product. 
Equation (6) is proved as follows: 


(a, b + ¢) = |a | pra (b + €) 
= |al|prab+ |a| prac = (a, b) + (a, e). 
And this is how equation (7) is proved: 
(a, ab) = | a | pra (ab) = | a |o@ pra b = az (a, b). 


Taking into consideration (5), it follows from (6) 
and (7): 


(a + b, c) = (a, c) + (b, oc), (6’) 
(aa, c) = a (a, b). (7°) 


Now we pass over to an.analytical description of vec- 
tors and points in space, i.e. to describing them with 
the aid of numbers. Let us introduce in space a rectangu- 
lar coordinate system x, y, 2, i.e. three mutually per- 
pendicular directed lines passing through a certain point O 
called coordinate axes x, y, z (Fig. 9). It is assumed 
that for this coordinate system a unit line-segment is 
chosen by means of which all other segments are measured. 
The point O is called the origin (of coordinates). 

Let us specify an arbitrary point A of a three-dimen- 


— 
sional space. The directed line-segment OA is called the 
radius vector of the point A. The racius vector, in its 


ae 
turn, defines the vector a (a = OA), which can be trans- 
lated in space. We denote the projections. of the radius 
vector a on the axes z, y, 2 by 2, y, 2, respectively. These 
are coordinates of the point A, which are called the 


abscissa (x), ordinate (y), and z-coordinate of the point A. 
—_> 


Between points A in space and their radii vectors OA 
or, which is the same, triples of numbers (z, y, z) which 


ind 
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— 
are the coordinates of the point A or the projections of OA 
on the coordinate axes there is one-to-one correspondence, 
That is why there will be no confusion if we call the 


Fig. 9. 


triple of numbers (z, y, z) either a point A having these 
numbers as its coordinates, or a vector a having these 
numbers as its projections. 

From the definition of the vector as a directed line- 
segment which can be translated in space, it follows that 
two vectors a = (2;, y,, 2;) and b = (2g, Yo, Z_) are equal 
if, and only if, the equalities 7, = 2, yy = Ye, 241 = 2 
are fulfilled simultaneously. The following equations 
hold true: 


ye zvtk@ iy, 27) =—@te,yty,2z+2'), 8) 
a (x, y, 2) = (ax, ay, a2). (9) 


Equation (8) follows from that the projection of a sum 
or a difference of vectors (on one of the coordinate axes) 
is equal to the sum or difference of the projections of the 
terms. 

Equation (9) follows from that the projection of the 
vector aa (on one of the coordinate axes) is equal to the 
product of a by the projection of a. 

Let us denote .by i, j, k unit vectors (of length equal 
to 1) having the same direction as the axes 2, y, 2, 
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respectively. The vectors i, j, k are called basis vectors 
of the axes x, y, 2. An arbitrary vector (z, y, 2) can be 
written in the form 


(x, y, 2) = zi + yj + 2k. (10) 
Indeed, 
i= (1, 0, 0), j=(0, 1, 0), k= (0, 0, 4). 
Therefore, by (8) and (9), 
xi + yj + 2k = z (1, 0,0) + y (0, 1, 0) + 2 (0, 0, 4) 
= (z, 0,0) + (0, y, 0) + (0, 0, 2) = (@, y, 2). 


Let us note the equations relating the scalar products 
of the unit vectors of the coordinate axes: 


ii = jj =—kk=1, ij = ik = jk = 0. 
Let now a = (z, y, 2), b = (2, y’, 2’). Then 
ab = (a, b) = 22" + yy’ + 22’. (14) 
Indeed, by virtue of (6), (7), (6’), (7’), 
ab = (xi + yj + 2k) (i + yj + 2k) 
= anit + xy"ij + x2'ik + yx’ji + yy'Si + y2'k 
+ ze’ki + zy’kj + 22’kk = xr’ + yy’ + 22’ 
In particular, making b = a in this formula, we get 
|a |? = aa = a? + y? + 2?, 
whence the length of the vector a is equal to 
jaj= +V e+ y? +2, 
Hence, the distance between the points a = (z, y, 2) and 
b = (z’, y’, 2’) is equal to (Fig. 10) 
|AB| = |b—a] = + V (2’— 2 + (yy F (2. 


For further treatment of the material it is important 
to sum up what has been said. To this effect, let us in+ 
troduce somewhat different designation of coordinates. 
Namely, we are going to introduce in space a rectangular 
system of coordinates 2,, x,, x3. Then every point of 
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space will be represented by a lriple of numbers 
x = (21, Lo, 3). 


We shall denote this point by a bold-face letter x and 
also call it a vector x with components 2, 2, 23. 

We have proved that addition and subtraction of vec- 
tors, as well as their multiplication by numbers, are 


Fig. 10. 
expressed in terms of triples (2, x, 23) in the foilowing 
way: 
(Xq, Za, £3) + (Ly, Ly, Zs) 
=(% t2,, tg 2), 23+ 25), (12) . 


O& (L4, Ta, Lz) = (ATy, HXy, OL). 


The scalar product of the vectors x = (z,, x», 43) and 


x’ = (xj, x, £,) is expressed in terms of the coordinates 
of the vectors x and x’ by the formula 


xx! = (x, x’) = 22, + rom, + 2373. (13) 


The length of the vector x = (x, Xj, 23) is a nonnegative 
number equal to 


[Ix] =V ait+ai4 zh. (14) 


Finally, the distance between the pointsx = (2,, og, Xs) 
and x’ == (|, ©, £3) is equal to 


[x—x'J= V (xi —x,)? + (2, — 22)? + (4) —25)®. (15) 


Real space whose geometry was studied here is called 
a three-dimensional space, since its points are naturally 
represented by triples of real numbers. We shall denote 
it by Rs. 
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It is quite natural to denote an arbitrary plane by Rg. 
In R, we can specify a rectangular coordinate system z,, 
x, With the aid of which any point of R, or its radius 
vector can be represented by a pair of numbers (2), x2). 
It is obvious that for vectors belonging to a plane the 
operations of addition and subtraction, as well as of 
multiplication by a number, obey conditions ,(12),§ pro- 
vided that the third components are deleted everywhere 
in the parentheses. The scalar product of vectors belong- 
ing to our plane is also expressed by formula (13) where 
the third term of the right-hand member should be delet- 
ed. The same should be done with formulas (14) aud (15). 

The space A,, where n is an arbitrary natural number, 
is a generalization of the spaces R, and R3. For n> 3 
the space R, is a mathematical fiction, or rather a great 
invention. lt helps to mathematically understand complex 
phenomena. 


Problems 


i. Find the lengths of the vectors (1, 1, 1), (4, —1, 4), and 
1, 2, 3). Ql 
2. Find the angle between the vectors (1, 0, 1) and (4, 2, 2). 

3. Given a unit cube (with the edge of length equal to 1). Find 
the angle between diagonals emanating from one of its vertices: 
(a) the principal diagonal and diagonal of a face; (b) two diago- 
nals of two faces. 


Sec. 6, n-Dimensional Euclidean Space. 
Scalar Product of Two Vectors 


The set of all possible systems (z,, ..., z,) of real 
(complex) numbers is called the n-dimensional real 
(complex) space and is denoted by A,. Each system will 
be denoted by one (bold-face) letter without a subscript: 


K = (2, .. «y Ln) 


and called a point or vector R,,. The numbers z,, ..., 2p 
are termed the coordinates of the point (vector) x or the 
components of the vector x. 

Two points 


KX = (4%, --.-, BH), xX = (%j, .- > Zp) 
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are regarded as equal if their corresponding coordinates 
are equal 


=o; (a1: eyes OD). 


In other cases x and x’ are different (x ; x’). 
The systems (vectors) x = (4%, ..., Za), Y= (Yn - 

-» Yn) can be added and subtracted, and aslo multiplied 
by the numbers a, §, ... which are real if AR, is a real 
space, and complex if R, is a complex space*. 

The sum of vectors x and y is defined as the vector 


x at y = (24 am Ya, Le si Yo oe 8 89 In a Un) (1) 
and their difference as the vector 
X— y = (1% — Wy, + +> Lp — Yn). (2) 


The product of the number a by the vector x, or the 
vector x by the number a, is defined as the vector 


CX XG = (OF. 4 4 ay OTe): 
Obviously, the following properties are fulfilled: 
(1) x+y=y+x, 
(2) x+y)+2=x+(y +2), 
(3) x —y =x-+ (—1)y, 
(4) ax + ay = a(x + y), 
(5) ox + Bx = (a + B)x, 
(6) a (Bx) = (af) x, 
(7) 1-x =x, 


where a, § are numbers and x, y € R,. 
The number (nonnegative) 


x] = V dae ja |? = -V daa (3) 


is called the length or norm of the vector x = (z,,.. ., 2,)- 


* For complex numbers see our book “Differential and Integral 
Calculus” (Sec. 5.3), Mir Publishers, 1982 (in English). 


4—01496 
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The distance between two points x = (a, ..., %,) and 
Y = (Yi, - ++» Yn) is determined by the formula 
Tr 
Ix—y|= 2 [zx — Yr]. (4) 
The scalar product of a vector x = (%, ..., Ln) by 
a vector y = (Y1, - +--+) Yn) is the number 
% 
x, y)= 2b ays, (5) 


where 7¥; is a complex number conjugate to the number y,*. 
The scalar product possesses the following properties: 
(a) (x, x) 2 0; it is equal to zero if and only if x = 

= (Oe Oy: 5. ag. OY, 
(b) (x, y) = (y, x), 
(c) (ax + By, 2) = a (x, 2) + B (y, 2). 


Indeed, 
(x, x)= > xjx;—= Dd} |xj|?>0, 
j= 
where equality may be the case if 4, =... =z, = 0. 
Further, 


(x, y)= (3 ain) £1y;) = pa LY j= a yjtj(Y> X)- 


Here we have taken advantage of the operation of con- 
jJugation z+ ca =z+t2, and zz, = 2-2,. Finally, 


(ax-+By, z)= > (cory + By;) 2; 


=a S272) +8 Dua alm 2) +BY, 2) 

j= j= 
We shall confine ourselves to the consideration of 
a real n-dimensional space. In it vectors are multiplied 


* The space R, into which a scalar product is introduced is 
called Euclidean. 
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by real numbers. The term “real” will sometimes be 
omitted. For a real space the modulus sign (of numbers) 
in formulas (3) and (4) may be substituted by parentheses: 


ni-V 3 3 


Ixy] = V3 (%y— Uy) (4) 


and the scalar product has the form 


(x, y= 2 iY js (5') 


since in this case y¥; = y;, and then (x, y) = (y, x), i.e. 
in a real space the scalar product is commutative. 

From the properties (a), (b), (c) there follows an im- 
portant inequality (called Buniakovski's inequality*) 


I(x, YI<V (x, YV(y, y)- (6) 


Indeed, for any real number A 


O< (x + Ay, x + Ay) 
= (x, x) + A(y, X) + 4 (x, y) + dM (y, y) 
= (x, x) +2 (x, y)A + (y, y) = = a + 200 + cd’, 
where a = (x, x), b = (x, y), ¢ = (y, y). We see that the 
quadratic polynomial 
ai-+2bA + cv S00 (—0o<A < oo) 


is nonnegative for any real \. Therefore its entire graph 
lies above the axis i which is possible only if the discri- 
minant of the polynomial is negative or zero, i.e. if 
b*? — ac <0 or b*? < ac, and we have obtained Bunia- 
kovski’s inequality (6). 

In terms of the components of the vectors x and y 
inequality (6) can be written as follows: 


nr nm re) 
INeml<YP 24 / > yj. (7) 
j=1 j=1 j=1 


* Buniakovski (or Bouniakowsky), Victor Jakowlevitsch 
(1804-1889). Russian mathematician. 


4 
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Thus, whatever the real numbers 2z;, y; are, inequali- 
ty (7) is fulalled. 

By virtue of (3), Buniakovski’s inequality can he 
written in the following manner: 


ly IS Ixttyl. 


But then there exists a unique number jd satisfying the 
inequalities —1 < 4 < 1, for which the rigorous equality 


(x,y) =Al[xllyl 


holds true. 

Note that on [0, x] the function cost has a single- 
valued strictly decreasing inverse function with the do- 
main of values on [—1, 4]. Therefore, for every A (—1 < 
<< 1) there exists a unique angle o (0X o® <7) 
such that A = cos w. Hence, we have proved the equation 


(x,y) = |x| ly | cosa. (3) 


The number o is called the angle between n-dimensional 
vectors x and y, although, in fact, for n > 3 the vectors x 
and y are not real line-segments at all, they are mathe- 
matical abstractions. 

The vectors x and y are called orthogonal if their scalar 
product is equal to zero: 


(x, y) on Zz LY p= 0. 


It follows from (8) that for nonzero vectors x and y to be 
orthogonal, it is necessary and sufficient that the angle w 
between them be n/2. 

We should like to mention here an important in- 
equality (called Minkowski's inequality*) 


ixtyl|<|ix|+lyl, (9) 


* Minkowski, Hermann (1864-1909). Number theorist, alge- 
braist, analyst, and geometer. Born in Russia, lived in Switzerland 
and Germany. Created the geometry of numbers. Provided the 
four-dimensional geometric space-time framework for relativity 
theory. 
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or in terms of components 


V Ste+u<V Sar Bu. do 
j=1 j=1 j=1 


It can be proved in the following way: 
Ixt+yP=(€+y, xt+y)=(@x) +2, y)+fy, y). 
Using Buniakovski’s inequality (6), we have 
IxtyP<(x, x+2V(x, x) V(y, y+, y) 
=(V(x, x) +V(y, y))?, 


whence follows (9). From (9) there follows the inequality 


iixl|—lyll<Ix—yl, (11) 
since 
(es eS yer ye | ee ay hs 
lyl=l|y—x+x/<[|x—yl[+ |x|. 


Problem. Find the angle between the vectors (1, 0, 
1, 0) and (4, 1, 4, 1). 


Sec. 7. Segment of a Line. Dividing 
a Segment in a Given Ratio 


Let us specify arbitrary points x, y € A, and introduce 
the set of points (vectors) 


z=Ax+tyuy (4,420, A+ypu= tf), (1) 


defined by nonnegative numbers 4, u whose sum is equal 
to 1. We have 


z=(1—p)x+py=xtupiy—x) (Ox<pxt) (2) 
or 

z=y+h(x—y) (0<A<1). (2) 
It is seen from equation (2) that in a three-dimensional 
space points z fill the line segment connecting x and y. 


The radius vector z is the sum of the vector x and the 
vector w (y — x) which is collinear with y — x (Fig. 11). 
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Hence, the set of points (1) represents a line segment 
[x, y] in R, joining the points x and y. For »p = 0z = x, 
for’ = Oz =y, foranyA >0 (up =1—A>0)z is an 
arbitrary point on [x, y]. 

The line segment [ x, y ], joining the poinis x, y € Ry, 
is defined as the set of all points z of form (1). The follow- 
ing theorem holds true. 

Theorem 1. The point 


z=i)xtuy (A, uw>0, A+ py =M) 
divides the segment [x, y] joining the points x, y € R, into 


Y(¥-x) 


Fig. 11. 


two segments whose lengths are in the ratio up: d. 

Proof. It follows from (2) that z — x = uw (y — x), and 
therefore the distance between the points x and z is 
equal to 

|z—x|=ply—x|. (3) 


Further from (2"): z — y = A (x — y), and therefore the 
distance between the points z and y is equal to 


joy enix yi. (4) 
It follows trom (8) and (4): 
lz—x]:l[z—yl=u:,, 


which was to be proved. 

Problem. On the segment [x, y], joining the points x, 
y € R,, find the point z that divides this segment in the 
ratio p:q¢(p>O0, g> 0). 
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Solution. Let us take the numbers 


ee! a 
ptqg’? ' p¥q° 
They satisfy the properties of A, p>O0, A+ p 
u/i. = p/q. Therefore, by Theorem 4, the oauemd: a 


= __ _9&-+ Py 
z=Ax-+ py ae (5) 
Its coordinates z = (2,,..., Z,) are expressed in terms of 
the coordinates x = (2,, ..., Yn), Y= (Vy, -- +) Yn) Dy 
the equation 
Qrj- PY; 
_— =e Pee eee o” 
4; pt (j 1, n) ( ) 


In particular, the middle of the segment is obtained for 
p=q=t,ie. A=p=1/2 

Note, that, asit is proved in mechanics, the point z de- 
termined by (5) or (5’) is the centre of gravity of the system 
of points x and y at which masses q and p are concentrated, 
respectively. 

Note that in R, the set of points 


z=aAxtuy, A+t+u= 


where A and uw are of either sign, represents a straight line 
passing through the points x and y. It is obvious from (2’). 

And in space R,, (n > 3) this set is called a straight 
line by definition. 

Example 1. Find coordinates of the ane of gravity of 
a system of material points x* = (x, zt, xt) with cor- 
responding masses p, (k= 14, ..., M). Using formu- 
la (5’) for the points x', x* we first find the centre of gravi- 
ty zi of the points x! and x" and then the centre of grav- 
ity z? of the points z' and x® with corresponding masses 
p,; -+ p. and py. Proceeding in the same manner, we obtain 
for the (IM — 1)st step 


M 
M-1 1 h : 
er ice rage 
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Sec. 8. The Straight Line 


The concept of the straight line is primary in geometry. 
From the axioms of geometry we know that a straight line 
is determined by two points, i.e. through two given points 
we can draw only one straight line. Through a point lying 


Fig. 42. 


on a given straight line we can draw only one straight 
line perpendicular to the given line. 

In the plane, let us specify a rectangular coordinate 
system (x, y) and a straight line L not parallel to the 
y-axis (Fig. 12). We know from school that the equation 
of the straight line Z has the form 


y = ke +1, (1) 


where k = tan @ and a is the angle formed by the line Z 
with the positive direction of the z-axis, and J is the 
ordinate of the point of intersection of Z with the y-axis 
(2 = OB). 

When we say that equation (1) is the equation of the 
straight line 2, we mean by this that Lis a locus of points 
whose coordinates (x, y) satisfy equation (1). The valid- 
ity of this assertion is obvious from Fig. 12. Point A is 
an arbitrary (moving) point belonging to the line L; its 
coordinates (z, y) are: BC =z, AC =y — 1, and 


k=tana= y= , (1’) 
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whence (1) follows. Conversely, equation (4) is equivalent 
to equation (1°), and the latter, obviously, expresses the 
fact that the point (x, y) lies on the straight line Z. In 
Fig. 12 the angle a is acute. In the case of an obtuse a we 
reason in a similar way. 

Consider the equation 


Az + By +C =0, (2) 
where A, B, C are specified numbers, A and PB being not 


both zero. 
If B ~0O, then equation (2) can be written in the form 


A C ; 
es (2’) 
or, setting 
A Cc 
k= — =, l=——=, 


in form (1). Since equations (2) and (2°) are equivalent to 
each other, i.e. any point (z, y), satisfying one of them, 
satishes the other, (2) for B ~ 0 
is the equation of a straight line 
inclined to the positive direc- 
tion of the z-axis at an angle 
a& whose tangent equals —A/B 
(tan @ = —A/B), and intersecting 
the y-axis at the point having 
the ordinate —C/B (1 = — C/B). 
For B =O equation (2) takes 
the form 


Ar+C=0 (A <0!), 


or 
z=a (a= —C/A),. 


This is also an equation of 
a straight line which is parallel 
to the y-axis. Namely, this is a locus of points (z, y) 
the abscissas z of which are equal to one and the same 
number a. Figure 13 shows such a line for a > 0. 

lt follows from the aforesaid that (2), where A, B, C 
are given numbers, A and B are not both zero, is the 


Fig. 13. 
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equation of some straight line. For B = 0 this line is not 
parallel to the y-axis. In particular, for A =O it is 
parallel to the z-axis (y = —C/B). If B = 0, then it is 
parallel to the y-axis. Note that y = 0 is the equation of 
the z-axis, and x = 0 is the equation of the y-axis. 

Equation (2) is called the general equation of the straight 
line. Any straight line situated arbitrarily with respect to 
the coordinate system can be described by the equation 
of form (2) for suitable constant numbers A, B, C. We 
underline once again that the numbers A and B in equa- 
tion (2) are not both zero. Note that the number ¢& in equa- 
tion (1) is called the slope of a straight line. 

Let us solve several important problems. 

Problem 1. Write the equation of a straight line, whose 
slope is equal to the number 4, passing through a given 
point (Xo, Yo). 

Solution. A straight line with the slope k has the form 

y=hkr +], (3) 
where / may be any number. Since the point (xy, yo) must 
lie on the given line, its coordinates must satisfy the 
equation 


Yo = katy + 1. (4) 
Subtracting (4) from (3) we get the required equation 
Y — Yo = k (€& — 2) (5) 


of the straight line passing through the point (zo, yo) and 
having the slope k 

Problem 2. Write the equation of a straight line passing 
through two given points (z,, y,) and (#., yz). It is as- 
sumed that the points are different. 

Solution. Let z, + z,. Then, obviously, the sought-for 
line ts not parallel to the y-axis, and therefore may be 
written in the form 


y— yy =k (« — aj), (8) 
where & is a certain number. Equation (6) itself expresses 
the fact that the straight line passes through the point 


(z,, y,). For this line to pass also through the point (7., ye), 
it is necessary that the equation 


Yo — YW = & (Lq — 2) (7) 
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be fulfilled. Dividing (6) by (7) (that is, dividing the left- 
hand member of (6) by that of (7), and the right-hand 
member of (6) by that of (7)), we get 


Y— Yi Be: t— £1 (8) 


YoY L_g— ty 


This is just the equation of the straight line passing 
(hrough the points (z,, y,) and (%»q, Yo). 

Remark. Jt might happen that y, — y, = 0, then, 
formally, we would obtain the equation 


Ue) a 


05 ee me 
In spite of the fact that this equation is senseless, for the 
sake of convenience, it is customarily written in such 
a way. Getting rid of the denominators, we obtain a true 
equality 


y—Yy,= 0-1 =0 


Ly — 2 
or 
y = Yr (9) 
The case z, = z, = c leads to the solution rz = c. 
Problem 3. Find the angle w between the straight lines 


y=khaet+h, y=kort ly. 

Solution. We have k, = tan @,, kz = tan G2, where ay, 
a, are the angles formed by the given lines with the posi- 
tive direction of the z-axis, respectively. We have (Fig. 14) 

tana,—tana, — ky—kp (10) 

4ttana,tana, 141-+kyk,’ 
which is the formula for the angle between two straight lines. 

The case 1 + k,k, = 0 or 


lyk, = —1 (14) 


expresses the condition for perpendicularity of straight 
lines. ‘Vhe condition for parallelism of straight lines 
(tan » = 0) will be written as follows: 


hy = ee (12) 


tan @ = tan (a@,— @_.) = 
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Let us write equation (2) in the following form: 
Aya, + Agr. + C = 0, (2°) 


where we have set t= 2%, y= «,, A =A), B=Az. 
For A, «0, A. #0, C ~ 0 equation (2’) can be written 
in the form 


Ly Lo ae 5 = C 

Srp eHi, a= 7 aaa oe ; (13) 
Equation (13) is the intercept form of the equation of a 
straight line. This line intersects the axis 2, (the 


Fig. 14. 


straight line x, = 0)at the point (a,0) and the axis zx, at 
the point (0, 6). 

If a straight line, satisfying equation (2'), passes 
through the point (z{, 2), then 


Subtracting (14) from (2’), we get 
A, (21 — 21) + Ag (fg — 24) = O. (15) 


Equation (15) is the equation of a straight line passing 
through a point (2), Z}). 

Introducing into consideration the vectors A = (A), Az), 
x = (x, 2_), x° = (a, x2), we may regard the left-hand 
member of (15) as a scalar product of the vector A by the 
vector x — x°. Therefore equation (15) can be written 
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Fig. 16. 


in the vector form as 
A (x — x°) = 0. (15’) 
The vector x — x° belongs to the line L (Fig. 15). Hence, 
it is seen from (15’) that the vector A = (A, Ag) Is 
orthogonal (perpendicular) to the given line. Thus, we 
have elucidated the geometrical meaning of the coeffi- 
cients A, and Ag. 
Consider two lines 


At, + Ao®e + C1, =O (Ly), (16) 
Byt, + Bot, + Cy =O (Lp). (17) 
Since the vectors A = (A,, A.) and B = (A,, B,) are per- 
pendicular to lines (16) and (17), respectively, the angle @ 


between the straight lines (16) and (17) is equal to the 
angle between the vectors A and B (Fig. 16). The angle @ 
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can be computed by the formula 


_ (A,B) _ A,B, + A,B» 
S°= TABI Vara vata | O 
Remark. If @ is an angle between two straight lines, 
then x — @ is also an angle between these lines. The 
number (18) can be either positive, or negative. The former 
corresponds to the angle , the latter to x — q. 
From (18) we obtain the condition for the perpendiculari- 
ty of Z, and L, (p = n/2, cos p = 0): 


A,B, + A,B, = 0. (19) 


If the lines ZL, and ZL, are parallel, then the vectors A 
and B are collinear and A = AB, where 4 is some real 


Fig. 17. 


number. Hence the condition for parallelism of straight 
lines is expressed by the equation 
A, eas A, 
Br Ba (20) 
Let there be given an arbitrary straight line Z in a rec- 
tangular coordinate system (Fig. 17) not passing through 
the origin, and let a be a vector emanating from the origin 
and perpendicular to the line £ with the foot lying on 
this line. The vector a completely determines the line ZL, 
since through its foot there passes the only straight line 
perpendicular to it. Let p be the length of the vector 
a = (p=ja |); v = (cos ay, cos @,)is a unit vector in the 
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same direction as a. Here a; is the angle between a (or v) 
and the posilive direction of the axis z; (i = 1, 2); 
cos* a, + cos?a, = 1 (cosa, = sin a,). Let us denote 
by x =: (x, Z,) an arbitrary (moving) point of the line 
L, or, which is the same, its radius vector. The pro- 
jection of the vector x on the unit vector v is, obviously, 
equal to p, i.e. the scalar product of the radius vector of 
an arbitrary point x on the line LZ by the vector v is 
equal to p: 


(x, v) = |v | prex = p. (21) 


Thus, we have obtained a vector equation of ZL, since, 
conversely, if a point (radius vector) x satishes equa- 
tion (21), then it lies on Z (a point not lying on ZL has 
a projection of v which is different from p). 

If LZ passes through the origin, then its equation can also 
be written in the form of (21), where v is a unit vector 
perpendicular to it. 

In the coordinate form, equation (21) is written as 
follows: 

2, COS, + 4%, cosa, =p (p>0) (21°) 
or 
Z,cosa@, + 2#,sina,=p (p> 9). (21”) 


This is the normal form of the equation of a straight line. 
If a straight line Lis given by the general equation (2') 
Aj2, + A oX> + C = QO, 


then it can be reduced to the normal form by multiplying 
it by the number 


M=+1/V42+24, (22) 


where the sign should be opposite to the sign of C (p = 
= —MC>0). The number M is called the normalizing 
factor. Since 


(MA,)* + (MA;)* = 4, 
there exists the only angle a, satisfying the inequality 
O< a, <( 2n, for which 
MA, = cosa,, MA, = sin ay. (23) 
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Asa result, we obtain equation (21), where p = — MC> 
> 0. Note once again that the number p is equal to the 
distance from the origin to the straight line. 

Problem. Find the distance d from the point x° = (2%, 
z,) to the line Z defined by the equation 


Aaa, +- AgX5 +. C =— Q. (24) 

Solution. Let 
(x, v) —p =0 (29) 
be a normal equation of line (24). Thus, if C 40, then 


— 
Pp (p > 9) is the length of the vector OA dropped from 
the origin O on Z and v is a unit vector which is in the 


—> —> —> 
same direction as OA (p=|OA|, v=OA/p, see 


Fig. 418 


Fig. 18). Let x be an arbitrary point on L. Then, obvious- 
ly, in order to find the distance from the point x° to L, 
we have to project the vector x — x® on the direction of 
the vector v and take the absolute value of the projection 
thus obtained: 

d= | pry (x — x’) | = | — x’, v) | 

= | (x, v) — (x°, v) | = [Pp — (&®*, v) | 

= | (x°, v) — p |. 
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We have obtained the formula 
d = | (x°, v) — pl. (26) 


Thus, in order to get the distance d, we have to reduce 
equation (24) to the normal form, transpose p to its left- 


Fig. 19 


hand side, replace 2,, z, by the corresponding coordina+ 
tes x}, x, of the point x°, and take the absolute value of 
the expression thus obtained. 


This is how formula (26) is written in terms of the 
coefficients A,, A», C: 


a= | Aya? + Agag+C | 
V AR+ AR 
For C = 0 formulas (26)-(26’) also hold true. In this 


case p = OQ, vis one of the two unit vectors perpendicular 
to L (Fig. 19). Now 


d = | pry (k — x") |= |(v, x — x’) | 


= | (v, x) — (v, x") = | (v, x) | 


(26°) 


or 
qa Arti Age | 
V AFAR 
i.e. (26’) for C = 0. 
Remark. It is obvious from Fig. 18 that if the origin 
and the point x° are found (a) on one side of L, then the 
5—01496 
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angle between v and x —x°® is acute, and d = p — 
— (x°, v), (b) on different sides of ZL, then the angle be- 
tween x — x? and v is an obtuse angle and d = (x°, v) — p. 


Sec. 9. The Equation of a Plane 


In space Rs, with a rectangular coordinate system (z,, 
Zz, %3) introduced, let us specify a vector a emanating 
from the origin O. Through its terminus a we passa plane 
L perpendicular to a (Fig. 20) and denote an arbitrary 
(moving) point of the plane L by x = (a, Zo, 23). 
The letter x also denotes the radius vector of the point x. 

Let p = |a] be the length of the vector a, and 


Vv = (COS Gq, COS Gy, COS M3) 


be a unit vector which is in the same direction as a. Here, 
4, He, G3 are angles made by the vector v with the positive 


Fig. 20 


directions of the axes 71, £y, X3, respectively. The projection 
of any point x€ L on the vector vis, obviously, a con- 
stant quantity equal to p: 


(x, v) =p (p> 0). (4) 
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Equation (1) has sense for p = 0 as well. In this case 
plane Z passes through the origin O (a = 0), v being 
a unit vector drawn from O perpendicular to Z in either 
direction, i.e. the vectorvis defined up to a sign. Equation 
(1) is the equation of plane L ina vector form. In terms of 
coordinates it is written as 


Ly COS A, + Ly COSA, + F3C0SG, =p (p> dQ) (1’) 


and is called the normal form of the equation of a plane. 
Multiplying equation (1’) by a nonzero number, we get 
apn equivalent equation of the form 


Aya, + Apt, + Agzys + B =O, (2) 


defining the same plane. Here the numbers A,, Az, Az are 
not all zero. Equation (2), where the numbers A,, Ag, Ags 
are not all zero, is called the general equation of a plane. 

An arbitrary equation of form (2), where the numbers 
A,;, Ag, Ag are not all zero, can be reduced to the normal 
form by multiplying it by the number 


M=+1/V A?+A?+ AP, 


where the sign is taken opposite to that of the number B. 
Then the number p = —MB _ will be nonnegative and 
equation (2) is transformed into the following equivalent 
equation: 


MA,z, + MA,z, + MA;2, = p. (3) 
Here 

(MA,)* + (MA,)? + (MA;)? = 1. 
This shows that the vector 


v = (MA,, MA,, MAs3) 


is a unit one (| v | = 1). Its projections on the coordinate 
axes are equal to 


MA, = COS Qy1,5 
MA, = Cos Qe, 


MAs; = Cos a3. 
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where @,, &), &; are angles formed by the vector v with 
the positive directions of the axes 7,, %_, Zs, respectively. 
By virtue of the introduced notation, equation (3) has 
the form 


X1 COS A +L, COS +73 COS Mz = P (p > 0), (3’) 


i.e. we have obtained the normal form of the equation of 
a plane. 

If we are given general equation of a plane (2) and it is 
required to find out its position relative to the coordinate 
system, then it is sufficient to reduce equation (2) to the 
normal form by multiplying it by the normalizing factor J. 

From equation (2) ilself (without carrying out any 
computations) we can only state the following two facts: 
(1) if B = 0, then the plane passes through the origin, 
and if B 0, then it does not pass through the origin; 
(2) the vector A = (A,, Ag, Az) is perpendicular to the 
plane, since it is collinear with the unit vector v = (MA,, 
MA,, MA;), perpendicular to the given plane. 

The equation 


Aik) a A Xo +. B — Q) (4) 


is a particular case of equation (2). In the plane (4, z,) 
equation (4) defines a straight line, and in space (21, 2%, 
£3) it is the equation of the plane LZ perpendicular to the 
coordinate plane (x,, x.) and passing through this line. 
Whatever the point (x,, x2, z3), belonging to the plane ZL, 
may be, its coordinates x,, x, satisfy equation (4) irrespec- 
tive of its third coordinate x;. The equation 


Ay, +B=0 (A, 0) (9) 


is a particular case of equation (4). It can also be written 
in the form 


t,=C (C = —B/A)). (5') 


In space (2, 22, 23), equation (5’) is a locus of points (2, 
XL, X3) whose first coordinate is equal to C, the coordinates 
ty, £3 being equal to any numbers. It is clear that (5’) 
determines a plane parallel to the coordinate plane 2%, 23 
(or perpendicular to the axis 2,). 
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Intercept form of the equation of a plane. If A; = 0 
(ij = 1, 2, 3) and B ~£ 0, then equation (2) can be written 
in the following way: 


Ly 5 te 
Gate ©) 
where a; = — B/A; (i = 1, 2, 3). Equation (6) is called 


the intercept form of the equation of a plane. This plane 
(see Fig. 21) intersects the axis Oz, at the point (a,, 0, 0), 


Fig. 21 


the axis Ox, at the point (0, a,, 0), and the axis Ox, at the 
point (0, 0, as). Judging by equation (6), it is easy to 
imagine the position of the plane relative to the coordinate 
system. 

The equation of a plane passing through a given point. 
Ii the point x° = (z], x}, x3) lies in plane (2), then its 
coordinates satisfy equation (2): 


3 
| Aj;x?+ B=0. (7) 
i=1 
Subtracting (7) from (2), we get 
3 
Dd) A; (4; —2j) = 0. (8) 
i=1 


Equation (8) is known as the equation of a plane passing 
through the point x° = (x2, x, 23). In terms of vectors, 
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equation (8) has the form 

A (x — x°) = 0. (8’) 
Since the vector x — x°, applied at the point x®°, belongs 
to plane (1), equation (8’) implies that the vector A is 
orthogonal to plane (1), which was ascertained before 
from other reasonings. 

The equation of a plane passing through three given 
points. Let there be given three noncollinear points 
(i.e. not lying on a straight line) 

xi = (a4, 13, 24), 

x? == (xi, 2, 23), 

x* = (z?, x2, 2). 
It is required to write the equation of the plane passing 
through these three points. We know from geometry that 
there exists such a plane which is unique. Since it passes 
through the point x!, its equation has the form 


A, (4%, — x1) + Ag (%_ — 5) + As (tz — 23) = 0, (9) 


where A,, Az, Ag are not all zero. Since it also passes 
through the points x?, x®, the following conditions must 
be fulfilled: 


ee ee es - 
Ay (a2 ~ 21) + Ay (23 — 21) + Ay (x3 — 23) =0. (10) 


Let us set up a homogeneous linear system of equations 
with respect to the unknowns (2,, Z», 23): 

(x4— a) 24+ (22 — x5) Z_ + (13— 25) Zz, = 0, 

(x? — xh) 24+ (x5 — £4) 2. + (x3 — 75) 23 = 0, (11) 

(xj — 2) 24 -+ (% — 2p) 2, -+ (73 — 25) 29 = 0. 
Here x = (%1, Ye, 23) is an arbitrary point satisfying 
equation (9). By (9) and (10), system (11) is satisfied by 
a nontrivial vector A = (A,, A>, A;), therefore its deter- 
minant is zero. 

ty—2,  Iyg—T, F328; 
gaz gi—z 2—x,|=0. (12) 
1 8 1 | 
2 2 


3 3 1 
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We have obtained an equation of form (9), i.e. an equation 
of a plane which is easily verified by expanding the ob- 
tained determinant in terms of the elements of the first row. 
And this plane passes through the points x!, x”, x®, which 
follows from the properties of determinants. Our problem 
has been solved. 

Equation (12) can also be written in the following form: 


ti By By 4 

rn ee ae | 

ae ef gt = 0. (13) 
1 2 3 
ae 

a te SS 


If the second row is subtracted from the first, third, and 
fourth rows, then determinant (13) remains unchanged. 
Expanding the determinant thus obtained in terms 
of the elements of the fourth column, we get equation (12). 

The angle between two planes. Let us specify two planes 


A,x,+ Apt, + A,x3+ B=, 
Ait, + A,r, + Ajt3-+ B’ =0. 
We know that the vectors A = (A,, Az, As) and A’ = 
== (Aj, Aj, A;) are perpendicular to the given planes, res- 
pectively, therefore, the angle m between A and A’ is 


equal to the angle (dihedral) between the given planes. 
But the scalar product 


AA’ = |A][A’ | cosqQ, 


(14) 


therefore 
AAT A, Al + AgAt + A5 A$ 
FATT A'| Vary at Ag V AP Ag? AP * 


It is sufficient to assume that OX p< na. 

Note that two intersecting planes actually form two 
dihedral angles @, and g.. Their sum is equal to x (9, + 
+ gm, = 2), and their cosines are equal to each other by 
absolute value but have opposite signs (cos g,= 
= —cos g,). If we replace, respectively, the numbers 
A,, A,, A;by the numbers —A,, —A;,, —Agz in equation 
(14), then the newly obtained equation will define the 


cos g = 


(15) 
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same plane, but the angle @ in (45) will be changed 
for m — q. 

The condition for perpendicularity of planes (14) will be, 
obviously, cos@m = 0, i.e. 


AA’ = A.A‘ + A,Ai + Az, = 0. (16) 


Two planes (14) are parallel when, and only when, the 
vectors A and A’ (which are perpendicular to them) are 
collinear, i.e. the proportionality conditions are fulfilled: 


- (17) 


If, in addition to this, the following extended condition 
is fulfilled: 


(18) 


then this means that planes (14) coincide, i.e. both equa- 
tions (14) define one and the same plane. 

Though it isimpossible to divide by zero, but it is con- 
venient to write symbolic proportions (17) or (18) with ze- 
ros in the denominators. But in this case, if, for instance, 
A, =0, then we must set A, = 0. Or, if BY = 0, 
then B = 0. 

Examples. The equations 


2x + 4y +1 = 0, 
xr+2Qy+5=0 
determine a pair of parallel planes, and the equations 
2x 4+- 4y + 2 = 0, 
x+2y+i1=0 
determine a pair of coinciding planes. 
The distance from a point to a plane. It is required to 


find the distance from the point x° = (z], «5, x9) to the 
plane LZ defined by the equation 


A,2, + A,X» + As3k5 + B= 0. 


To this effect, let us reduce the equation of LZ to the nor- 
ma .form: 


(x, v) = (p = 9). 
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It is seen from Fig. 22 that the difference x — x°® between 
the radius vector of an arbitrary point x belonging to the 
plane £ and the radius vector of the point x° is a vector 
such that the absolute value of its projection on v is equal 
to the required distance d from x° to L: 
d = | (x — x®, v) |, 
but 
(x — x°, v) = (x, v) — (x’, v) = p — (x9, v). 


Consequently, 
d = | (x, v) — p |. 


We see that, in order to compute the distance d from the 
point x° to the plane Z, we have to write the equation of L 


Fig. 22, 


in the normal form, transpose p to the left-hand meuiber, 
and substitute x® for x. The absolute value of the obtained 
expression is just the required number d. 

It is obvious that in terms of the parameters of the 
plane, 


ey 


V AR+ ARs AB 
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It is easy to see that if the point x° and the origin are 
situated on different sides of the plane Z (as in Fig. 22), 
then the vector x — x® forms an obtuse angle with v and, 
therefore, 


d= — (x — x®, v) = (x®, v) —p > 0. 


But if the point x° and the origin are found on one side 
of L, then an acute angle is made, and 


d = (x — x®, v) = p — (x°, v) > 0. 
Consequently, in the first case (x®, v) > p, and in the 
second (x®, ») <p. 


Example 1. The distance d from the point (1, 1, 1) to 
the plane L 


r+2y+2=3 
is equal to 
2+2y+z—3 an oe 
Vt+441 |eatpyei,zi V6" 


In this case the point (1, 1, 1) and the origin are situated 
on different sides of the plane Z since M > 0 and 


fe + 24y +2 — 3] x=1,y=1, 2=1 1 > 0. 


Problems 

1. Reduce the following equations of planes 

e—yt+sz=2, 22 —y + V 202 = 10 

to the normal form. 

2. Find the angle between the two planes: 

(a) ce tytz=1 and y = 0; 

b)xz—ytz=2 and «+tyt+2=3. 

3. Write the equation of the plane passing through the points 
(0, 0, 1), (4, 0, 4), (1, 1, 0). 

4, Write the equation of a spherical surface with centre at the 
origin, tangent to the plane 2x — 6y + 7z = 2 


Sec. 10. A Straight Line in Space 


Consider in space an’arbitrary straight line 2. Mark on 
it a point x° and a vector a = (a, @,, a3) = 0, applied 
at the point x° (Fig. 23). An arbitrary (moving) point of 
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the line Z will be denoted by x. The vector x — x® can be 
written in the form x — x® = ta, where ¢ is a certain 
number (scalar). If the :eal variable ¢ runs through the 


Fig. 23. 


interval (—oo, oo), then x = x® + #a traverses tlie entire 
length of Z. Therefore, the equality 


x—x*°=ta (~—~wo<t< oo) (1) 


is said to be a vector equation of the straight line passing 
through the point x® in the same direction as a. 

When written in terms of the coordinates, equation (1) 
decomposes into three equations: 


Ly — XL, = tae, (1’) 


3 —2,= tas. | 
Eliminating the parameter t, we get the equations of 
a straight line (a system of two equations) 


Mia 7y Sq HQ gh 1”) 
ay Gs ag 


where the numbers a,, @,, a3 are not all zero. Equations (1) 
are called the equations of a straight line in the canonical 
form. 

Remark. It may happen that one or two of the numbers 
@, Ay, Ag are equal to zero. Then it is still customary to 
write equations (1") with a zero or even two zeros in the 
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denominators. Such notation then becomes symbolic, but 
it 1s convenient. 
Example 1. The equations 


z—t  y-—2 __ 2-8 
| re: (a (2) 


define a straight line in space passing through the point 
(1, 2, 3) in the direction of the vector (1, 0, 2). 


These equations can be replaced by the following equi- 
valent ones: 


y—2=0-(¢—1), 2(@—1)=2—8, 


eS 2 eS 2rd. (2°) 


Thus, the straight line under consideration is an inter- 
section of two planes defined by equations (2’). 
Example 2. The equations of a straight line 


z—1  y—2 2-8 


{ 2 
are equivalent to the following ones: 
y—2=0, z2-—3=0. 
Let there be given equations of two planes 
A,zx+ Any + Age t+ B=), (3) 
Ait + Aly + Alz + BY =0. (4) 


If the coefficients A; of the first of them are respectively 
proportional to the coefficients A; of the second (A, : A,: 

> Ag = A, :A,:A;), the planes (3) and (4) are parallel 
or even ‘coincide (provided that A,:A,:A,:B= 
= Al: Aj: A;: B’) (see (17) and (18) from the preceding 
section). Otherwise. (3) and (4) intersect along a straight 
line. In this case one of the determinants 


A, A,|’ |A, A; A, A; 
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is not equal to zero. For the sake of definiteness, let us 
assume the first one: 


A, A, 
A, A, 


Then equations (3), (4) can be solved with respect to x 
and y, and we obtain 


r= aZ+ iL, ! 
+) 
y=Pz+v, o 


where a, 8, u, v are certain numbers. Equations (5) are 
equivalent to the following ones: 


+= (0. 


L— ph y—v 2 


a B ame (6) 


We see that the straight line defined by equations (3) 
and (4) passes through the point (1, v, 0) in the direction 
of the vector (a, B, 1). The numbersa, f or one of them 
may be equal to zero, then equations (6) will have a sym- 
bolic character. 

Example 3. The straight line defined by the equations 
x= 0, y = 0 is, obviously, the z-axis. This result can 
also be obtained formally. We have 


x=O0-z, y=O-z, 
whence 


x y z 


0 
i.e. we have obtained the equations of the straight line 
passing through the origin (0, 0, 0) in the direction of 
the vector (0, 0, 1). It is clear that this line is just the 
Z-axis. 

Example 4. Find the angle between the straight lines 


re | ate = 
ty tg %Q iB xk 
_——————— SS ea oe et Se 


= (7) 
ay a9 ag 
Ty — x} be Log— 23 _ t_g— x3 


pha net ose 8) 


The vectors a = (a1, @., a3), b = (01, be, D5) lie on these 
lines and, according to our agreement, they are applied 
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9 


at the points xt = (x), x}, xl), x? = (ci, x{, x2). Therefore, 
the angle @ between these vectors will be just the angle 
between straight lines (7) and (8): 


__ 4161+ Gaby + gb 
08 fallbl 


Problems 


1. Write the equation of the straight line passing eee the 
point (2, —1, 0) perpendicular to the plane 22 + z — 4y = 7, 

2. Write the equation of the plane passing through the point 
(1, —1, 2) perpendicular to the straight line defined by the equa- 
tions In + By = 4, 3z —2= 1. 


Sec. 11. Orientation of Rectangular 
Coordinate Systems 


Figures 24 and 25 represent two different coordinate 
systems (Z,, Z,). They are said to be oriented in an opposite 
way. In the case presented in Fig. 24 it is possible to 
bring to coincidence the directions of the axes x, and 2, 


x4 
X9 
b A 
A b 
a a 
: a 
| aS 9 - 


Fig. 24. Fig. 25. 


by turning the axis z, about the point O through an angle 
of x/2 anticlockwise. In the case of Fig. 25 this can be 
achieved only by rotating the axis z, clockwise. It is impos- 
sible to bring to coincidence the coordinate system repre- 
sented in Fig. 24 with that depicted in Fig. 25 (so that the 
directions of their corresponding axes coincide) by dis- 
placing it in the plane under consideration (!) as a solid. 
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Figure 24, as well as Fig. 25, shows a pair of noncollinear 
vectors a and b emanating from a certain point A. Dis- 
placing this pair as a solid in the plane, let us bring the 
point A to a position in which it coincides with the ori- 
gin O. Let us set ourselves the task of bringing the vector a 
into the direction of the axis z, and aligning the vector b 
with the axis z, by rotating either of the mentioned vect- 
ors about the point O. It is required that during this proc- 
ess the vectors a and b be situated all the time in the 


Fig, 26. 


plane under consideration and that the angle between them 
be not equal to 0 or x. Obviously, this can always be 
achieved. First we rotate the system of vectors a and b as 
a solid about the point O to bring the vector a into coinci- 
dence with the positive direction of the axis z,. Since 
the vectors a and b are not collinear, the vector b will be 
found either in the upper, or lower half-plane. Then we 
turn the vector b through an angle sufficient for b to be 
brought to coincidence with the axis x,, b being not al- 
lowed to get onto the axis z,. Therefore, it may happen that 
the direction of the vector b will coincide with that of the 
axis z, (it is quite possible when b was situated in the 
upper half-plane) or the direction of b will turn out to 
be the same as the negative direction of the axis z, (see 
Fig. 26 illustrating this process). In the first case the 
pair of vectors (a, b) is said to be oriented in the same way as 
the. coordinate system (x,, Z_), while in the second case the 
pair (a, b) is oriented in an opposite way. 


80 Fundamentals of Linear Algebra and Analytical Geometry 


The rectangular coordinate systems 7,, 72, £3 in space 
represented in Figs. 27 and 28 are also different. Consid- 
ering the coordinate system shown in Fig. 27 as a solid, 
we come to a conclusion that, displacing it in an appro- 
priate way, it is possible to bring to coincidence the axes 
zr, and x, of both coordinate systems. But the positive 


Fig. 27. Fig. 28. 


direction of the axis xz, of the first system does not 
coincide with that of the second system. The systems 
depicted in Figs. 27 and 28 are said to be oriented in an 
opposite way. The system of Fig. 27 is called the left- 
handed coordinate system and that of Fig. 28 the right- 
handed coordinate system. If a screw with a right-hand 
(left-hand) screw thread is driven in the direction of 
the axis «3, being turned in the direction indicated by the 
arrow in Fig. 28 (Fig. 27), then it will perform transla- 
tional motion in this direction. We can also distinguish 
a coordinate system according to the following rule. 
If a viewer located at a certain point of the positive 
semiaxis Ox, looks at the positive semiaxis Oz,, then the 
positive semiaxis Oz, may be found directed either left- 
ward, or rightward. In the former case the coordinate 
system is called left-handed (Fig. 27), while in the latter 
case it is called right-hauded (Fig. 28). 

Vectors a, b, e are called coplanar if they lie in one 
plane or are found in parallel planes. 

Let us take a system of noncoplanar vectors a, b, ¢ 
applied at a certain point A and rotate the vector b 
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about the point A in the plane of the vectors a and b, 
until it turns out to be perpendicular to a. Make sure 
that during this rotation the angle between a and b is 
never equal to 0 or x. Then we shall rotate the vector c 
about A to make it perpendicular to the vectors a and b. 
Pay attention that the vector ec never coincides with the 
plane of the vectors a and b. Asa result, the vectors a, b, ¢ 
will turn out to be mutually perpendicular. Let us now 
transfer this triple of vectors, asa solid, to the point O and 
rotate it about this point in order to make the vectors a 
and b coincide with the directions of the axes 7, and Zp», 
respectively. Two cases are possible here: (1) the vector ¢ 
will be in the same direction as the axis 73; (2) it will be 
in the opposite direction. In the first case the system of 
vectors a, b, ¢ is said to be oriented in the same way as the 
coordinate system z,, Z,, X3, while in the second case it is 
said to be oriented in an opposite way (see Figs. 27 and 28, 
respectively). 


Sec. 12. The Vector Product of Two Vectors 


Let us specify in some rectangular system two vectors 
a = (a), 4, a3), b = (0,, 5, Dy). 
The vector 
c=a X b=[a X b] = (a,b, —a,0,) i 


ij k 
++ (@3b, — 4403) J 4+ (a4, — agb,)k =| a, ay ag}. (1) 
b, by dg 


is then called the vector product of a and b. 

The last term of the above written chain is a “general- 
ized determinant” whose first row consists of the vectors 
i, j, k (basis vectors of the coordinate system). The second 
“link” of the chain shows how to understand this gener- 
alized determinant (we expand the determinant of the 
third order in terms of the elements of the first row so as if 
i, j, k were numbers). 

It is obvious that [a X (— b)] = — [a x Dl. 

It is also possible to define the vector product of two 
vectors a and b in the following manner: 


6—01496 
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(4) if a and b are collinear, then their vector product is 
equal to zero; 

(2) if a and b are noncollinear, then the vector ¢ is per- 
pendicular to a and b so that the system (a, b, c) turns out 


Fig. 29. 


to be oriented in the same way as the given coordinate 
system. The length of the vector e is equal to 


je] = Jal: |bl]sino Oxon), (2) 


where is the angle between a and b, i.e. the length of ¢ 
is equal to the area of the parallelogram constructed on 
the vectors a and b (Fig. 29). 

Let us prove that the two above formulated definitions 
are equivalent. 

If the vector ec = 0, then it follows from (1) that the 
components of the vectors a and b are proportional 


fy i dg} dg = 6, : be: dg, 


i.e. a and b are collinear, but then the vector product is 
equal to zero by the second definition as well. Conversely, 
if a and b are collinear, then, by the second definition, 
a X b = 0. Since the components of the vectors a and b 
are proportional here, then, according to the first defini- 
tion, ec = 0. 

Let now a and b be noncollinear vectors and ¢ their 
vector product according to (1). It is obvious that 


(ec, a) == (a,b ——™ A305) ay + (a,b, = Q53) Bo 
++ (@b2 — 42b,) a3 = 0 
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and, similarly, 
(c, b) = 0. 


Hence, the vector ¢ is perpendicular to a and b. 

Let us prove that the system of vectors (a, b, ¢) is 
oriented in the same way as the coordinate system (2, 
Zo, X53). Let us continuously rotate the vectors a and b 
about the point O, each time computing by them the vec- 
tor c, all the time keeping a and b noncollinear. But then 
the vector e all the time will be not equal to zero (c ~ 0) 
and the system (a, b, e) will be noncoplanar all the time. 
Let us rotate the vectors a and b so as to align them with 
the directions of the axes z, and z,, respectively, i.e. so 
as to bring them to the form a = ( {a ], 0, 0), b = (0, 
| b |, 0). This can always be achieved, since in this case the 
plane containing the vectors a and b can rotate in space. 
But then the vector c computed by formula (1) has the 
form c = (0, 0, | a | | b |). We see that at the last instant 
of our process the vectors (a, b, ec) are oriented in the same 
way as the axes (2,, Z_, 73). But then, according to the 
definition of orientation (see Sec. 11), the initial system 
(a, b, ec) is also oriented in the same way as the coordinate 
system (%,, X_, Zz). 

Thus, the vector product c = a X b, defined by formu- 
la (1), is a vector perpendicular to the vectors a and b, 
and the system (a, b, c) is oriented in the same way as 
the considered coordinate system (2,, 22, 23). 

Now we are going to prove formula (2). Let the vectorsa 
and b form with the coordinate axes angles respectively 
equal to a, @,, @; and B,, B., Bs. Since 


aj =|aj|cosa; 6b; = |b]| eos Bj G = 1, 2, 3), 
then 
Jc |? = (@2b3 — agbe)” + (a3b, — a463) 
+ (4,52 — ab,)? = |a |? | b [? [(cos a, cos By 


— COS @z cos B,)? 
-+ (cos a; cos B, — cos a, cos ps3)” 


+ (cos Q, COs Bo — COs Kh COs B)?] 
6% 


84 Fundamentals of Linear Algebra and Analytical Geometry 


= |a |? |b |? [(cos? a, + cos? a, + cos? as) 
x (cos? B, + cos B, + cos® By) 
— (cos a, cos B, + cos a, cos B, + cos a3 cos f5)7) 
= lal? |b? [4-1 — cos? a] = [a [? |b  sin® o, 


where w is the angle between the vectors a and b 


3 
(cos w = py cosa; cosB;). 


Thus, we have proved (2). 

Hence, we completely proved that from the definition 
of the vector product by formula (1) there follows its 
second definition. 

Conversely, if a vector obeys the second definition, then 
it is unique, since there can be only one vector perpen- 
dicular to a and b whose length is equal to the area of the 


Xo 
Lo 
a 
O x4 
X3 
Fig. 30. Fig. 34. 


parallelogram constructed on a, b, such that the system 
(a, b, c) is oriented in the same way as the system (7, Xo, 
Z3). But then it is the vector ec defined by formula (4), 
since the latter, as we saw, possesses the mentioned 
properties. 

We should like to note once again that the condition 
a X b = 0 is a necessary and sufficient condition for the 
collinearity of the vectors a and hb. 

Now consider two nonzero plane vectors 


a= (a, dy), b= (bj, bs) (3) 
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in a rectangular coordinate system (z,, z,) (Figs. 30, 31). 
Regarding the plane under consideration as given in 
space, let us add to the axes z,, +, an axis %3 which is 
perpendicular to them. The vectors a, b will then have 
the following coordinates: 


a= (a4, a, 0), b= (B,, be, 0). 
Their veclor product is equal to 
Gy a, 
by 
where k is a basis vector of the axis x3. According to the 
definition of the vector product, the system (a, b, ¢) is 


oriented in the same way as the coordinate system (7,, 22, 
X%3). Therefore, obviously, if the determinant 


e=axh= k, (4) 


ay Ay 


> 0, 


then the system of vectors (a, b) must be oriented in the 
way the coordinate axes (z,, 7.) are oriented. And if the 
Jeterminant 

a, Go 


by |b, 


then the system (a, b) has an orientation opposite to that 
of (z,, 2). The first arrangement of the vectors (a, b) is 
shown in Fig. 30, the second in Fig. 34. We also know 
that the area of the parallelogram constructed on the 
vectors a and b is equal to (see (4)) 


<Q, 


a, ay 
b, Oy 


9 


i.e. to the absolute value of the determinant 
a, Qo 
b, b, by (9) 


Thus, we have proved that: (1) the absolute value of 
determinant (5) is equal to the area of the parallelogram 


A= 
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constructed on the vectors a = (a,, a) and b = (by, b,); 
(2) the sign of determinant (5) depends on the arrangement 
of these vectors relative to the rectangular coordinate 
system (z,, z,). To the plus sign there corresponds a sys- 
tem (a, b) oriented the same as (7, z,.), and to the minus 
sign there corresponds a system (a, b) oriented in an 
opposite manner. 
The following properties hold true: 


ax b= —[bxal, (6) 
a x (ab) =a [a xb], (7) 
aX (b-}-c) = [a xX b] 4 [a x €], (8) 


where a, b, e€ are arbitrary vectors, and @ is a scalar. 

Expressing the vector products entering into equations 
(6), (7), (8) by formula (1) in terms of the components of 
the vectors 


a= (a, Qo, Qs), b= (O,, bo, bs), as (cy, C9, C3), 


we easily obtain these equalities. 

Formulas (6) and (7) readily follow from geometrical 
considerations as well. Let a and b be noncollinear vectors. 
If a and b are interchanged in the vector product, then’ 
both the area of the parallelogram constructed on a and b, 
and the perpendicular to a and b will remain unchanged. 
Only the direction of ec = a X b will reverse thus chang- 
ing the orientation of (a, b, a X b). 

The multiplication of the vector b by a positive number 
a enlarges a times only the area of the parallelogram con- 
structed on a and b, the direction of the vector product 
remaining unaltered. But ifa <0, then 


a X (ab) =a X (—|a |b) =|a | [a x (—Db)] 

= —|a]|la X b] =ala X bi. 
One more remark: it also follows from (6) and (7) that 
(aa) X b = — [b X (aa)] = — a[b X al] = ala x b]. 


Example {. Determine the angle A of the triangle ABC 
with the vertices: A = (1, 2, 3), B = (2, 2, 2), C = 
= (1, 2, 4). 
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Let us denote the required angle by w, which is actu- 


— —> 
ally the angle between the vectors AB and AC. By the 
second definition of the vector product, we have 


—> —> 

_ «| ABX AC | 

sin ® = — J? 
| AB || AC | 


—_ —> —> _ 
where AB=(1, 0, —1), AC=(0, 0, 1); [| AB)=V2, 
—> 

| AC |=14, 


> — > 9 
ABxAC=|1 O —1|/=-—j.- 
0 0 
Hence 
; oe _ « _ 38m 
ey 2 ae Os @ = i 


as —> —_> 
Since BC = (—1, 0, 2) and | BC P =5> [AC P+ 


a 
+ |AB/?=1+2 = 3, it is necessary to take o = 
= 3n/4. 

Remark. If in the triangle ABC the angle A is a right 


—s 
one, then | BC |? = BC* = AC* + AB*; if A is obtuse, 
then BC? > AC? + AB’, if A is acute, then BC? < 
< AC* + AB?. 


Sec. 13. Triple Scalar Product 


The triple scalar product of three vectors a, b, ¢ is a scalar 
equal to the scalar product of the vector a X b by the 
vector Cc: 


(a X b) ¢= (a,b; — @gbq) Cy + (ag, — @403) Co 
Qy 4, G3 
“| (@40_ — Agb,) ¢3 = b, bg 053}. (1) 


Cy Co Cg 
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By virtue of the definition of a scalar product 
(a X b)e = |a X b| prayne = ( [al- [b] sin) praxne. 


Therefore we may, obviously, assert that the triple prod- 
uct (a X bje is equal to the volume of the parallelepiped 
constructed on the vectors a, b, ec. The latter is taken with 
the sign + or — depending on whether the system of vec- 
tors a, b, ¢ is oriented as the coordinate system 2,, Zo, 23 
or in an opposite way. Note that | praxp,e | is equal to the 
altitude of the parallelepiped. 
There exist the following equalities 


(a X b) e = (e X a) b = (b X e)a, (2) 


which easily follow from the properties of determinant (1). 
If the vectors a, b, ¢ lie in one and the same plane, then 


(aX b)e = 0, 


since a X b is perpendicular to the vector c. Conversely, 

if (a X b) c = 0, then the vector c is perpendicular to the 

vector a X b and, consequently, lies in the plane con- 

taining the vectors a and b or in a plane parallel to this 

plane. . 
Thus, the condition 


(a X bhe = 0 


is a necessary and sufficient condition for the coplanarity of 
three vectors a, b, e. 

Example i. Find the condition for belonging of four 
points to one plane. 

Let there be given four points A; = (2;, y;, 2;) 7 = 4, 
2, 3, 4). If these points lie in one and the same plane, then 

—_> -——>>— > 

the vectors A,A,, A,A3, A,A, also lie in this plane and, 
consequently, their triple product is zero: 


etnies 6 asad, Lo—L_ Yo—Ys1 2— 2 
(A,A, X A,As3) A,A, =|2%3—2%, Y3—Yy 23-24 = 0. 
Le—Xy Yu Uy Fy 


This is just the condition of belonging of four points to 
a single plane (cf. (12) from Sec. 9). 
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Sec. 14. Linearly Independent System 
of Vectors 


Let us specify in R, a system of k vectors 
a® oe (Qs, Agar + + 44 Zsn) (s = 1, os Shy k), (1) 


System (1) is defined as linearly independent if from the 
vector equation 
A, a2 + roa +- ee + A,a® ae 0, (2) 
where A,, Ao, ..., Ay are numbers, it follows that 
iS he = os Se 
System (1) is called linearly dependent if there exist num 
bers A,, A., .. ., A, not all equal to zero, for which equa- 


tion (2) is fulfilled. If, for the sake of definiteness, we as- 
sume that A, 54 0, then it follows from (2) that 


a® = pat +... ++ aya’, (3) 
where 
r NE 
B= Go BR = i 


Thus, if a system of k vectors is linearly dependent, then one 
of them is said to be a linear combination of the remaining 
vectors, or, in other words, one of them depends on the 
remaining vectors. 

Since all the time we shall speak of a linear dependence, 
the term linear will sometimes be omitted. We shall also 
speak of dependent and independent vectors instead of 
dependent or independent system of vectors. 

One vector a! also forms a system which is linearly inde- 
pendent if a‘ 0 and dependent if at = 0. 

If an entire system of vectors a!, ..., a® is linearly in- 
dependent, then any part of thissystem a!,..., a®°(s < k) 
is linearly independent all the more. Otherwise, we could 
find a nontrivial system of numbers i,, .. ., A, for which 


Ajai... +t A,a® = 0, 
would be fulfilled. But then for the system 4,, ..., Ag 


kh—s times 
e a 
0, .... 0 which is also nontrivial, the following would 
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hold true 

Matt ...+ A a®+ O-att?+ .. 4 0-a® = 0. 
Hence it follows that if a system of vectors a?, . . ., a° is 
linearly dependent, then any augmented system 


8 s h 
Bo bch a SA Py ede OR 


possesses the same property. In particular, a system of 
vectors containing a zero vector is always linearly depen- 
dent. 

Let us form a matrix determined by the vectors of 
system (1): 


Ary Ang---Akn 


Theorem 1. Jf rank A =k, i.e. rank A is equal to the 
number of vectors, then system (1) is linearly independent. 

But if rank A <k, then system (1) is linearly dependent. 

Example 1. Two vectors at = (@,;, @.), a? = (@9), @29) 
in R, form a linearly independent system if the determi- 
nant 


~ 


Ay, Ai 


#0, 


Qo, Ae 
since the vector equation 
A,a3 + 2a” = 0 (4) 


is equivalent to two equations for the corresponding com- 
ponents 
yyy + Agydg = 0, (5) 
yyy + Agodg = 0. 
But if A 40, then system (5) has a unique trivial solution: 
hy == ho = 0; (6) 


And if A = 0, then equations (5) are satisfied by a certain 
nontrivial system (A,, A), i.e. for A =O the system of 
vectors a}, a’ is linearly dependent. 

Obviously, to say that in R, the vectors a’ and a® are 
collinear is the same as to say that they are linearly de- 
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pendent. But then to state that the vectors at and a? are 
not collinear is the same as to say that they are linearly 
independent. 

Example 2. A system of vectors al, a?, ..., a® (k> 3) 
in R, is always linearly dependent. Geometrically this 


—- Ewe EE a ae ae 


Fig. 32. 


is Clear from Fig. 32: if ¢ is an arbitrary vector and a, b 
are noncollinear vectors, then we can always indicate 
numbers A, p such that 


c = da + wb. 


This shows that the system a, b, c is linearly dependent. 
But if a and b are collinear vectors, then they are linearly 
dependent. And a, b, e are linearly dependent all the more. 

According to Theorem 1, to investigate the pair of. vec- 
tors at, a?, we must write the matrix formed from their 
coordinates 


Gig Ato 
Qo, Ano 


Ju the given case k = 2. 

(a) lf rank A =1<2 =k, then the theorem states 
that the vectors a!, a” are linearly dependent. 

(b) If rank A = 2 =k, then the vectors a!, a® are 
linearly independent. 

This coincides with the above conclusions, since in the 
case of (a) A = 0 and (b) A 0. 

The fact that three arbitrary vectors a!, a*, a® in Ry are 
linearly dependent is also stipulated by the theorem. As 
a matter of fact 


rank AX 2<03 =k. 
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Example 3. In three-dimensional space R; two vectors 
at = (44, Gyo, 13), A” = (Aq1, Azo, Ae3) 


are linearly dependent when, and only when, they are 
collinear. 

Indeed, let a‘, a? be collinear. If one of the given vectors 
is zero, then they are linearly dependent. But if a‘ and a? 
are collinear and nonzero, then 


al = ia’, 


where A is some number. This means that a', a? are linear- 
ly dependent. 

Conversely, if a’, a? are linearly dependent, then one of 
them depends on the other, for instance, 


i.e. the vectors are collinear. 
Consider the matrix 


Gy4 Ayn 43 || 
A= ‘ 


Goi Ae, Aas 
Since the elements of its rows are proportional, then 
rank A=1i<i2= kh, 
i.e. our assertion conforms to Theorem 1. 
Example 4. Let us now consider three vectors in fs: 
al = (444, Qi, G15); 
a? = (@o4, A221 23); 
a? == (31, G32, @33)- 
Let 
Aig Ayg 843 
A=]@o_ Gog Gag|e 
As, A32 433 
The vector equation 
Ayat + Aa? + Aga® = 0 (7) 
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is equivalent to the system of three equations 
Ayhy + Aghe + A343 = 0, 
Ayohy + Agohe + Ag2h3 = 0, (7) 
Aygh4 + Agghe + Ag3hg= 0. 


If A=40, then system (7’) has a unique trivial solution 
Ay = A, = Ag = OV. But then equation (7) also has a unique 
trivial solution and the system of vectors (a', a?, a%) is 
linearly independent. 

If A = 0, then system (7’) and, consequently, equation 
(7) have a nontrivial solution (A,, 4, 43). But then the 
system of vectors (a1, a”, a°) is linearly dependent. The 
following peculiarities are distinguished here: 

(1) Let rank A =1, where 


Ay, Ayn Ayg 
A=|]@o4 gq Ag 
A234 23. Ags 


Then at least one of the rows of A, let, for the sake of 
definiteness, the first one, has at least one nonzero ele- 
ment. Consider the matrix 

aq a a 
val” 12 M431) (8) 
It has rank 1, therefore all the second-order determinants 
generated by this matrix are equal to zero 


Qo, Gog 93 


A344 Ayn} | Gag 43 Q42 43 


= Q. 


Ao, Ao As, og Qo, gg 


But then, obviously the components of the vectors a! 
and a® are proportional 


Go, _ G2 __ Gog = 
Saat tee oe ay ee fa seas +) 
ait 12 13 
i.e, 
oy = AyA, Gog = Ayghy og = Ay gh 
or 
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Analogously, taking into account that in matrix 


Qiy Qing gy 


Q3_ G30 G33 


all the second-order determinants are also zero, we get 


ae 


= wal, 
where » is some number. Hence, in this case the vectors 
al, a?, a® are collinear. 

(2) Let now rank A = 2. Then one of the matrices 
consisting of two rows of the matrix A has rank 2. Let, 
for the sake of definiteness, this be matrix A’ (see (8)). 
On the basis of example 3, the vectors a! and a? are linear- 
ly independent. But the system a!, a’, a? is dependent, 
i.e. for a certain nontrivial triple of numbers (Aj, A., Ag) 


Ajat + A,a? + Aza? = 0. (9) 


Here A, 0, since otherwise 4,a' + A,a® = 0, and, by 
virtue of independence of the system a!, a? we would have 
A, =A, = 0. But then equation (9) can be resolved 
with respect to a?: 


nN ny 
ae=pyal+ pa, by = ae ae Le = mc oe 
3 3 


Thus, if A = 0, and rank A’ = 2 (see (8)), then the 
vectors a! and a’ are noncollinear, and the vector a?® 
belongs to the plane containing these vectors. 

The above reasoning does not contradict theorem 1. 
Indeed, if A 0, thenrank A = 3 = k and, by theorem 1, 
the system of vectors a}, a’, a® is linearly independent. 
And if A = 0, the rank A < 3 and the system of vectors 
a', a’, a® is linearly dependent. 

Proof of Theorem {,. Vector equation (2) is equivalent 
to the following n equations for the vector components: 


AiQyy+ AgGor+ ... + An@ar =D, 
AyQye + AgQan-+ ... + An@ne = 9, (2’) 


eo 8 8 8 © w© @ @ @ #  @ j@  @ @®  e@  @#  @  @ 


AiQin +Aodon +... +An@an = 0. 
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Let rank A =k. Then, obviously, k< n. There exists 
a nonzero determinant of order & generated by matrix A. 
Without loss of generality, we may regard that this is 
a determinant made up from the coefficients of the first k 
equations of system (2’): 


Aydyy eee + Anny =0, ! 


eo 8 © © © &@ eB eo ee lel (10) 
Asdyn t+ .+- + ApnQan = 9, 
Oy, -+. Ap, | 
° © © 6 «@ ~=(). 
Qip +++ Apr 


Since homogeneous system (10) hasa nonzero determinant, 
it has only a trivial solution 


Me ha ee eh = 
and system of vectors (4) is linearly independent. The 
statement of the theorem has been proved for this case. 


Let now rank A =s<k. We renumber system (2’) 
and variables A; so that 


Then the system of the first s equations can be written 
as follows: 


© @ 8 @ @ @ 8 @ @ @  @ @  @  @©  @®  @®  @®  @  @  @  e#  @  @®  @  @ 9 ® 


Q4sA4 + owe + AsaNe = — Ages, s\s44— o258° DS AnsAp- 
Setting Aga, =... =A, = 1, we can solve the given 
system with respect to 4,, ..., As. As the result, we 
get a nontrivial solution 

Ris tis 5 (11) 


of the first s equations of (2'). Now any & — s equations 
of (2’) can be added to these s equations, and the newly 
obtained system will have the found nontrivial system 
as its solution all the same. In order to explain this 
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assertion, let us formally write system (2’) in the fol- 
lowing way: 


May, big + Appt Ans: O+ Sits +i1,-0=0, 


Bik Baik dae ciel Oba, Oe “a eg fae Pl, Ws Me Ts ee ee, He ae 2) 


a 


Ay@yn + ows + Npapn + Apa OF ets +h,:0=0. 


We see that the matrix of the coefficients of the obtained 
equations has a rank s, the same as the augmented (by 
zeros on the right!) matrix. The first s equations of sys- 
tem (2°) are satisfied both by the found numbers A,,... 
..., An (see (41)) and arbitrary numbers Ap,+,, . ~~, An- 
By assertion (2) from Sec. 4 (the rules for solving sys- 
tems), thenumbers),, ..., 4, satisfy the remaining equa- 
tions of system (2’) as well, i.e. the numbers A,, ..., Ax 
(not all equal to zero) satisfy the remaining equations of 
system (2). Thus, the vectors at, ..., a® are linearly 
dependent. 

We have proved the theorem under consideration for 
this case as well. 


Sec. 15. Linear Operators 


Let there be given an arbitrary square matrix 
A=|lay |=] occ , (1) 


Matrix A may be regarded as an operator associating 
with every vector x = (a, ..., @,)€ Ry, vector y = 
= (Yj, -++, Yn) € Ra, whose components are com- 
puted by the formulas 


Yo = Agyty tw... $A ln, Y 


Cr (2) 


or, in a briefer manner, 


m= i ayey (U1, (2") 
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i.e. the ith coordinate of the vector y is written with the 
aid of the ith row of A. We shall write this operator in 
a brief notation: 

y= Ax (x, y€ R,) (2") 


In particular, in the case of one-dimensional space R, 
the vectors x and y are numbers, and operator (2”) turns 
into a function: 


Y = Ay. 
Operator (2") is linear. This means that it satisfies the 
conditions 
A (ax + Bx’) = a@Ax + BAx’ 
for any vectors x, x’ € R, and numbers a, fs. Indeed, 


nr 


nT n 
, f ~~! , ° 
a Ai; (@X;y-+ Bri) = & 21 Qj jty +B pa Q,jxy (t= 1, .2., 7). 
j= = j= 


If the matrices A = || ap, || and B = || bz, || are equal, 
i.e. have equal corresponding elements ap; = bz, then 
they determine identically equal operators: 


Ax = Bx, Vx € Ry. (3) 
Conversely, it follows from equation (3) that 
Gat = Op: (A, L= i, oy n), 
i.e. the equality of the matrices A and B (A = B). This 
can be easily verified by setting in (3) 
ee = (07 oe 0 Os ad ae 0); 
where 1 occupies the /th place (J =1, ..., nm). 
Hence, to different matrices A there correspond differ- 
ent operators, if two matrices A, and A, differ from 


each other at least by one element, then there necessarily 
exists a vector x for which 


A,X = Aox. 


Let, in addition to A, there be given another operator 
B defined by a square matrix of order n 
B= || dar |I- 
7—04496 
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To every x € &, there corresponds (due to operator A) 
a vector y € #, to which, with the aid of operator B, 
there corresponds a vector z with the components com- 
puted by the formulas 


n 
2. 2 Oni; (Kent, ...,2) 
7 


As the result, we get a composite linear operator 


z= BAx (x € Rn), (4) 
where 


n nh rt 

@ 4 .1 
Ze = >) Oni = 2) One Dd) O52; 

i=1 i=1 jt 


Tt Tt Te 
= Ba (> by 103) y= Di Vr gry 
j=1 i=1 j=1 


with a matrix | ,; | called the product of matrices B 
and A, and denoted as 


BA = |l Yryq Il, (9) 


where 
Tr 


Pri= pa Datiy (K Jol, oe, M). (6) 
i.e. in order to obtain the element y,,; of the matrix BA 
(belonging to its Ath row and jth column), we have to 
multiply the elements of the Ath row of matrix B by the 
corresponding elements of the jth column of matrix A 
and add the result. 

The determinant of matrix BA is equal to the product 
of the determinants of matrices B and A: 


|BA |= |B| {A |. (7) 
This property follows from the formula for the prod- 
uct of determinants (see Sec. 2, property (j)). 


Let the matrix of operator A (see (1)) have a determi- 
nant which is not equal to zero: 


A = | apy, | 4 0. 


In this case (see Sec. 4, Theorem 1) system of equations (2), 
or, which is the same, the operator equation y = Ax has 
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a unique solution x € R, for any given y € R,. And the 
formulas by which x is found for a given y € A, have 
the form 


ty= >) PUL (j=1, cobs n). (8) 
a | 
Here 
bjs = Ayy/A (8, f =4, 2.45 2) (9) 


(see Sec. 4, (3’)) where Ag, is the cofactor of the element 
a,; in the determinant A. 

However, now it is of importance for us only to note 
that the numbers b;, are elements of the matrix 


B= |j bj. Its 
possessing the following remarkable properties: 
BAx =x, VxER,, (10) 
ABy=y, VyER,. (41) 


Indeed, by means of operator A, an arbitrary vector 
x € R, turns into a certain vector y which, by means of 
operator B, turns back intox. On the other hand, to every 
y € &, there corresponds, via operator B (see (8)), 
a certain x such that Ax = y. 

In equation (11) we may, obviously, replace y by 
another letter, therefore, we have obtained the identities 


BAx = ABx =x, Vx. 


The operator x = Ex (Vx €R,) is called a unit op- 
erator. Its corresponding matrix has the form 


1 0...0 
rl9 s oars 0, 
OF Usiecat 


and is termed a unit matrix. We have proved that 

AB = BA = E. 
Operator B, possessing this property, is called an inverse 
of operator A and is denoted by A-!. Accordingly, its 
1% 
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matrix is also denoted by A. The elements of matrix 
A-! are found by the elements of matrix A with the aid 
of formulas (9). 

We have proved that if the determinant | A | of a 
square matrix A is not equal to zero, then it has an inverse 
matrix A-!. Thus, for A-! the following properties are 
fulfilled: 


AA = AAT= E, 


If the determinant of matrix A is equal to zero (| A | = 
= ()), then it has no inverse matrix. It is sufficient to 
say that the equation y = Ax has a solution not for 
any y. 

Note that the property AA@1y = y, if it is fulfilled, 
states that to every y € R, there corresponds (with the 
aid of operator A~‘) an x such that it is the solution of 
the equation y = Ax. 

Remark. The operation of multiplication of matrices 
can be extended to nonsquare matrices Band A, provid- 
ed that the number of columns of matrix B coincides 
with the number of rows of matrix A. Then the multi- 
plication of matrices is carried out by formulas similar 
to (6). For instance, if 


4.2.3 1 4 
B={910}], A=joO 1], 
0041 1 0 
then 
43 
BA=|0 1 
4 0 


In this case we cannot consider the product AB, since 
matrix A has two columns, while matrix B has three 
rows. 

For square matrices A and B the products AB and BA 
have sense, but AB is by far not always equal to BA. 
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For instance, if 
F 4 ') E ) 
= 1)? ®=\o o} 


, ¢ ') a (; ) 
BENG. qe BN, a) 
i.e. AB=- BA. It is easy to verify that (AB)C = 
= A (BC). 
If A is a Jinear operator, then the notation Ax may be 


regarded as the product of matrix A by a one-column 
matrix 


then 


Let there be given linear operators A and B. Their sum 
is an operator A + B defined by the equation 
(A+ B)x =Ax+ Bx, Vxe R,. 


Obviously, the matrix of operator A + B coincides with 
the matrix equal to the sum of the matrices of operators 
A and B. 


It is easy to verify that 
A(B+C)=AB-+ AC. 


Example 1. Find the inverse of the matrix 


1 2 0 
s-(0 1 ) 
14 0 


Let us denote the elements of the inverse matrix A™! 
by 6;,. We have A~=1, 6;,=As,;, 


14), 04 
1 o[= ts 4e=—|4 9 


0 14 
Ays=|, 1 = —f, 


Ay — 


a 
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2 O 1 0 
An=—, 5|=° 22 >> 4 5/=0. 
1 2 
A= —|i 4 =, 
2 0 1 0 
4 =|, t|=2. An= —|q |= —4, 
1: 2 
4n=|) 4|- 
Hence, 
by = Ay = — 1, bp = Ay =O, dyg = Ag, = 2, 
bey = Ayo = 1, boo = Aga = 0, bo, = Age = — 1s 
bs, = Ayyg = — 1, bgp = Agg = 1, 033 = Aggy = 1. 
Thus, 


—1 Q 2 
| 4 QO -*) 
—1 1 4 


Sec. 16. Bases in Rk, 


In space R, let us introduce n vectors 


(1) 


im=(0, 0, ..., 0, 1), 


called basis vectors of the axes of the Ry, space. 

The azis x, of the R, space is defined as the set of 
points of the form (0, ..., 0, z,, 0, ..., 0) where z, 
occupies the kth place and runs all real values, and the 
vector i* is called the basis vector of the axis z,. 

If a = (a, .. ., 2,) is an arbitrary vector in AR,, then 
it can, obviously, be written in the form of a linear com- 
bination of vectors (4) as follows: 


a= xi" +2,i?%+...+ 2,3". (2) 
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Since it follows from the equationa = (%,...,2%,) = 0 
that 7, =...=2, = 0, the system (i', ..., i”) i 
linearly independent. 

Let there be given an arbitrary system of xn linearly 
independent vectors 


al = (dy, eeeg Bin), 
_ e e e e eee 8e8 @ @ (3) 
a = (Qna> oa eg Ann) © 


As we know (see Sec. 14, Theorem 1), system (3) is lin- 
early independent if the determinant 


8 (0). (4) 


an, eee Ann 


And if A = 0, then system (3) is linearly dependent. 
According to Theorem 1 from Sec. 14, any n + 1 vec- 
tors in the A, space are linearly dependent, since the 
rank of the matrix formed from the components of these 
vectors does not exceed n. Therefore, if a = (z,,.  ., 2) 
is an arbitrary vector and the system of vectors (3) is line- 
arly independent (A = 0), then the system of vectors 
at, ..., a", a is linearly dependent, i.e. there exist 
numbers A,, -.., Any An+1, not all zero, such that 


dat +. eee + ae 1 he 4+ 7a = 0, 
where A,,+, + O (otherwise system (3) would be linearly 
dependent). Hence, 


a= > xta*, (5) 


where 2, = — (k=1, ..., m) are certain numbers. 


Let us express ‘sum (5) in terms of basis vectors i (see (2)): 


a= S) 2 (3 anit) = pa (Sd anek yi. 


h=1 


On the other hand, by (2), 


n 
a = >) zi! 
= 
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By virtue of linear independence of the system i’, ..., i” 


the coefficients of identical vectors i’ must be equal 


m= 2) Anil, (i= 1, e009 n) (6) 


Thus, if the components z, of the vector a according to 
the system i', ..., it are known, then the components 
x}, of this vector according to the system a’, ..., a” are 
found from (6) in a unique way, since the determinant 
of system (6) is A #0. 

We have proved that whatever a linearly independent 
system of vectors al, ..., a" may be, any vector a € R, can 
be decomposed according to this system, i.e. can be represent- 
ed in the form of sum (5), where zx, ..., Zn are certain 
numbers determined from (6) in a unique way. 

In this sense a system of vectors a1, . . ., a” is called the 
basis in R, meaning by this that any vector a € RA, can be 
represented in the form of linear combination (5) of these 
vectors in a unique way. We have proved that an arbitrary 
linearly independent system of n vectors in R, is a basis 
in R,. 

A linearly independent system of m vectors 


— 
ee (4, - + +> Bm) ay, m+is °° %) Bn), 


a = (Qmi1) © + +7 mm: 2m, mtir ++ es Amn)s 


where m <t nis not a basis in R,. Indeed, the rank of the 
matrix of components of these vectors is equal to m. We 
shall hold that the first m columns of this matrix form 
a determinant not equal to zero. Let us augment this 
matrix by adding to it a row at the bottom 


at — (0, ..., 0, 4, 0, ..., 0), 


where 4 occupies the (m -++ 1)st place. The augmented 
matrix has the rank (m+ 1), and, consequently, the 
system of vectors a’, ..., a”, a™+! is linearly indepen- 
dent. But then the vector a™+! cannot be a linear com- 
bination of the vectors belonging to the system a!,..., a” 
and, hence, this system is not a basis in Rp. | 
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Let us denote the matrix of vectors (af, ..., a") by 
4, B40 Qin 
A=l|ll -- ++ ee ees 
Qni ne Qnn 

The transition from the basis (i}, ..., i”) to basis 


(at, ..., a”) is realized with the aid of matrix A: 


>) a,,i° (k=1, ..., 7), (7) 


s=1 


i.e. the vector a" is expressed in terms of vectors i*° with 
the aid of the Ath row of matrix A. The transition from 
(at, ..., a") to (i’, ..., i”) is effected by means of in- 
verse matrix A- (see (9) from the preceding section) 


>) bya’ = (s=1,...,n), (8) 
l=1 


whose elements are computed by the formulas 6,; = 4 : 
where A,, is the cofactor of the element a;, in the deter- 
minant A (note that the element },, belonging to the sth 
row and /th column is expressed in terms of the cofactor 
Aj, of the element a,, belonging to the /th row and sth 
column). Note also that 


n 


Tr n 
a= >) rja’= >) 2,1 =>) ts 2D 31a = 3 (3 b,x.) a, 


l==4 s=1{ s=1 
whence 
n 
xji= >) bei, (9) 
s=j 
i.e. the transition from the coordinates (2, ..., Z,) to 
(Tj, -. +, In) is effected with the aid of the matrix 


(A-1)* = (A*)-1, 


It is seen from (9) that x; is expressed in terms of 2,,.. . 
., &, with the aid of the /th column of matrix A- 
or the /th row of matrix (A~‘)* conjugate to At, 
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Further, it is seen from (6) 


nT 
w= D1 apt; (s=1,..., 7”) 


that the transition from (7, ..., 2) to (a, ..., Z,) is 
accomplished with the aid of matrix A* conjugate to A, 
i.e. zs; is expressed in terms of x}, . . ., t, by means of the 
sth row of matrix A* or the sth column of matrix A. 

Remark. It was established in Sec. 14 that an arbitrary 
square matrix 


A=| ....-e. (10) 
Qny --- Ann 


determines the linear operator y = Ax (x € R,, y € A) 
defined by the formula 


Ya= 2 Ons (k=1,...,n). (14) 


But the converse statement is also true: whatever the 
linear operator y = Ax (x € R,, y € Ry) may be, it is 
determined by a certain matrix (10) so that the vector 
y = Ax is computed via the vector x by formulas (11). 

Indeed, let there be given an arbitrary linear operator 
y = Ax (x€R,, y € A,). Let us denote with its aid the 
images of the basis vectors i* in the following way: 

n 
A (i*) = (yg, Gags os +) Ons) = a; Gigi (SH 1, 4.0, nN) 


Same 


Then, by ‘virtue of linearity of A, any vector 


Te 
x= ai... 4+2,i"= >) 2,1 
s=1 


is mapped with the aid of A into the vector y defined by 
the equalities 


y= Ax=A(> 2,i*)= >) 2,A (i’) 
3s= s=1 


n n ee gee 
=>, Us pa Angi = 4 (2 Gn sts) ', 
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whence it follows that the kth component of y is deter- 
mined by (11). Thus, operator A generates matrix (10) 
whose columns are formed by the coordinates of the 
basis vectors. 

Example 1. Find the matrix of a linear operator (trans- 
formation) A consisting in rotation of the vectors of 


Fig. 33. 


plane R, emanating from the origin through an angle 
a(0O<a<_n/2) anticlockwise. 

Let us take the vectors i? = (1, 0), i? = (0, 1) for the 
basis. It is then obvious that (Fig. 33) 


A (i') = (cos a, sin a), 
A (i?) = (— sin a, cos @). 


Therefore, the matrix of our operator has the form 
cosa —sina 
A=|. : 
sin @ COS & 
Sec. 17. Orthogonal Bases in fi, 


Two nonzero vectors x, y € A, are said to have one and 
the same direction if there exists a positive number A such 
that x = Ay. 

We say that an arbitrary nonzero vector x € R, can be 
normalized by replacing it by the unit vector 


y= (lyi=4) 


having the same direction as the vector x. 
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The unit vector (having the norm (length) equal to 1) 
is called normal. 

Two vectors x and y in space R,, are termed orthogonal 
if their scalar product is equal to zero: (x, y) = 0. 

A system of vectors 


Me ot Gee ER (1) 


is called orthogonal if any two of its vectors are orthogo- 
nal. System of vectors (1) is said to be orthogonal and 
normal or, which is the same, orthonormalized if 


Bi Tiel ae, 
ER) = n= {o peep, pee): 


i.e. all the vectors of the system are normal and pairwise 
orthogonal. If system of vectors (4) is orthogonal and 
all of its vectors are not equal to zero, then, by normal- 
izing them, we shall, obviously, get an orthonormalized 
system. Orthonormalized system (1) is linearly independent. 
Indeed, let 


Ayx? + Ax*? +... + Ax” = 0, 


where 4,,..., Ay are numbers. Multiplying this equation 
by x* in a scalar way, we, obviously, get 
As (x’, x‘) =A, =O (= i,..., ¥). 


But then an orthonormalized system of m vectors in A, is 
a basis and, consequently, every vector a € A, can be 
represented in the form of a linear combination 


n 
a= >) A,x*. (2) 
k=1 


Multiplying this equality by x* in a scalar way, we get 
(a, XY) =A, (s =1, ..., 7) 


and, consequently, 


a= >) (a, x*)x*?, VaR. 
a4 7 
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The number (a, x") (| x” | = 4!) is, consequently, a pro- 
jection of the vector a on the direction of the vector x’*, 
Theorem 1. An orthonormalized system of vectors 


xo x4... xv Ov <n) 
can be supplemented to form an orthonormalized basis in R,. 
In other words, it is possible to indicate vectors x*t1, . . ., x” 
such that the system 

Msc spare Re ees (3) 


will be orthonormalized and, hence, is a basis in R,. 
Proof. Since v <i n, in R, there exists a vector a lin- 
early independent of x’, ..., x”. But then 


iV 


a= > (a, x") x? y, 


h=1 
where y = Q. The vector y is orthogonal to all the vectors 
x', ..., xv. Indeed, 


Vv 


(y, x‘)=(a— > (a, x") x*, x) =(a, x')—(a, x) =0 (4 


h=1 
(s=1,..., v). 
N ormalizing y, we get the vecto) 
1 
vt1— 7 v+ti] — 
x we Nas) 
and the system 
Xr eisueg. ky ee 
will be orthonormalized. If v + 1=n, then we have ob- 
tained a basisin A,. If not, then we proceed in the same 
way, until the (n — v)th step yields basis (3) in R,. 
The system of basis vectors of the axes in A,, 
ii = (1, 0, 0, ..., 0, 0), 
i? = (0, 1, 0, ..., 0, 0), 


im = (0, 0, 0, ..., 0, 4) 


* However, in R, for n > 3 the scalar product (a, x"), where 
| xk | = 1, ts defined as the projection of a on the direction of xh. 
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may serve as an example of an orthonormalized basis 
in f&,. 

An arbitrary vector a = (z,, .-.., Z,) € A, is decom- 
posed into basis vectors in the following way: 


fr 
k=1 


where z; = (a, i*) (k = 1, ..., n) is the projection of 
the vector a on the direction of the basis vector i*. 

Let there be given some definite orthonormalized sys- 
tem of nm vectors 


a= (aq, » Qin), 
ets ech es hk ; ! (6) 
a” = (An4; Gnn)s 
i.e. 
a*— >) agi (ke 4, ...,7). (7) 
s=1{ 


Here, the transition from the vectors (i', ..., i”) to 
(a, ..., a”) is realized with the aid of the matrix 


Ax . 8 ee we , (8) 
Any +++ Onn 


i.e. the vector a® is expressed in terms of i’, ..., i® with 
the aid of the Ath row of matrix A. 

The matrix A is orthogonal, i.e. it possesses the fol- 
lowing property: 


2, AnsQig—=5yr (kK, L=1, ..., mr). (9) 


Indeed, since in this case the system a!, ..., a” is ortho- 
normalized, we get 


n n 
Spi = (a®, a’) — (21 Qn si’, sy aii") 
= 


Te 


n nin 
= DS > Ap giy (i =>) > Ap 51,5 sr = ->, Zps@ls- (10) 
— 


ir=1 s=1r=1 


‘ 
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We see that, conversely, orthogonality of matrix (8) im- 
plies orthonormalizability of the system of vectors a',... 
..., a” defined by formulas (7). 

This shows that formulas (7), where || ays || are arbitrary 
orthogonal matrices, determine all possible orthonormalized 
bases in R,. 

Let us multiply equation (7) by i! in a scalar manner: 


(a*, i’) = abl 


Hence 
Tt Tr 
ié— >) (a*, i’) a*= >) apa. (11) 
h=1 h=1 
Thus, the transition from the basis (a!, . . ., a") to basis 
(it, ..., i") is realized with the aid of matrix A* conjugate 


to A. Since transformations (11) are inverse to transfor- 
mations (7) (see Sec. 15), we have simultaneously proved 
that the orthogonal matrix A possesses the following 
remarkable property: 


At = A*. 
It follows from (414): 


nr Ti 
Sai =(i", i) =(D) aga’, >) a,a") 
s=1 r=1 


n nT Tt 
= 2s pay AgnA71 (a®, a”) = py QgkAel- (12) 


= 


We defined the orthogonal matrix as such a matrix in 
which the rows (i.e. the vectors representing them) are 
normal, and different rows are orthogonal. From this 
definition, as is seen from (12), it follows automatically 
that in an orthogonal matrix the columns are normal as 
well, and different columns are orthogonal. 

The transition from (71, ..., t) to (tj, ..., Zn) is 
effected with the aid of the matrix (see (9) in the preceding 
section, x’ = Ax) 


(A*)-l = A** = A, 
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i.e. (regarding that a= >) zja’) 
[=4 


ad 2, Qislse (4 3) 


And the transition from , ++ +y En) tO (%, .~ -, Zp) 
is carried out via A* conjugate to A (see (6) in the pre- 
ceding section), 1.e. 


nr 
Ls = Zs. QlstI.- 
{= 


Note that the determinant of the arbitrary orthogonal 
Hares A (see (6)) is equal (by absolute value) to 
|] A tT = {aa || = 1. 
are follows from the fact that 


100... 0 

010... 0 
JA}? = anil - laul=1 3, Qniol=|........ =1. 

00 0... 14 


Here we hold that the element y,; of the product of the 
determinants is equal to the sum of products of the ele- 
ments of the kth row by the corresponding elements of 
the ith row (see Sec. 2, property (j)). 

Note also that the determinant formed of the compo- 


nents of the vectors constituting the basis i’, ..., i®* is 
equal to 1: 

1400... 0 

O40... 0 = 

0 0 0 4 

If an orthogonal basis a’, .. ., a" has the determinant 

| A | = 4 (see (6)), then it is ae to be oriented in He 
same way as the basis #, ..., iW. But if |A |= 


then it is said to be oriented in gn opposite way. ae 
definitions conform to the corresponding definitions given 
in Sec. 11 for the two-dimensional and three-dimension- 
al cases. 
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Remark. If in the matrix A (see (8)) the coordinates 
of the vector a‘ in the basis (i', . . ., i") were placed in 
the Ath column (A = 1, ..., n), then the transition from 
the coordinates of the vector x’ to the coordinates of 
the vector x would be realized by means of the rows 
of matrix A. And the passage from x to x’ in formula (43) 
would be achieved with the aid of matrix A*. © 


Sec. 18. Invariant Properties of a Scalar 
and a Vector Product 


Remark. In a three-dimensional space, let there be 
given two rectangular coordinate systems (2, Bo, L3) 
and (Ya, Yor I Y3) with the systems of basis vectors (i', i?, i?) 
and (j', j*, 7°), respectively. Let 


3 
= Syapei?  (k=1, 2, 3) (1) 
s=1 
(cf. (7) and (13), Sec. 17). Then the matrix 
A= || &yo ll 


is orthogonal and (cf. (13), Sec. 17) 


3 
= ast, (1=1, 2, 3). (2) 
s=1 


One and the same point (vector) of space has coordi- 
nates (components) (z,;, Z, Z3) in the first coordinate 
system, and (yj, Ys, Ys) in the second, the formulas for 
transforming the coordinates (i.e. for transition from 
(41, Zq, Zs) to (Y;, Ye, Ys3)) being Just the same as the 
formulas for transforming the system of basis vectors 
(7, i*, i?) to the system of basis vectors (j?, j?, j°). In 
both cases we apply one and the same orthogonal matrix 
(see (13) and (7) in the preceding section) 

A = || ear |}. 
In the first case the matrix A is applied to the system of 
numbers (x), 2, £3) in order to obtain the system of num- 
bers (¥;, Yo: ¥3), While in the second case the same matrix A 
is applied to the basis vectors (i!, i®, i®) to obtain the 
basis vectors (j', j*, j°). 


8—01496 
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Let us give a general definition of the vector used in 
tensor calculus. 

The vector in a three-dimensional Euclidean space is an 
entity that can be expressed in any rectangular coordinate 
system by a certain triple of numbers which are transformed 
in the same way (with the aid of the same matrix) as the 
triples of the basis vectors of the corresponding coordinate sys- 
tems. 

Similar terminology is also used for the case of n-dimen- 
sional spaces. 

Let a and b be vectors of a three-dimensional space 
defined in coordinate systems with the basis vectors 
(7, i°, i®) and (j’, 7°, 3°) as follows: 


a = dx, i* 4- dx, i? + ax,1° = dy,j' + dy,F° + ay}, 
b == by, i! + 65,0 + 65,13 = by,j! + by 7? + b,,). 
The following equality is valid: 


3 3 
ab= >; Ox, Ox = > Ay Dy 
s=1 r=1 


showing that the scalar product is invariant with respect 
to the transformations of rectangular coordinate systems. 

Indeed, since the systems of vectors (i', i*, i*) and 
(j7, j?, 7°) are orthonormalized, they are transformed by 
formulas (1), where || az; || is a certain orthogonal ma- 
trix. The components of the vector a (ax,, Gx, Qx,) are 
transformed to the components (dy,, dy,, @y3) with the 
aid of the same matrix (see (2)). Therefore 


3 3 3 3 

Tey y 
» Ay by, = >) (> ApsAx. 2: GryDx, ) 
r=1 T={ s=1 v=-1 


Ao 


3 3 3 
=> 5S) Ox by, ( S) Opry) = dx.b. = ab. (3) 
s=1 v=1 r=1 s 

Thus, we have proved the invariance of the scalar prod- 
uct ab using a computational method. However, from 
the other, geometrical, definition of the scalar product of 
two vectors (directed line-segments), by virtue of which 
ab = |b |-pr,a, it is directly seen that this number is 


il 
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an invariant, since this definition is not related with 
any coordinate system. 

As regards the vector product a X b, it is invariant 
relative to rectangular coordinate systems with common 
orientation. In systems having the basis vectors (i', i, 
i?) and (jt, j°, j°), the vector products are expressed in 
the following manner: 


- 


ip £6 
[a x b]i:, #2,i3=|42x, Ax, xs], 


[a x b] j:, Jj, A he — Ay, ay, ay, e 


By formulas (1) and (2), 


3 3 3 
Sat Sas Dat 
s= s=1 s=—1 
3 3 3 
[a x bly, pe, = py Aistx, bs Ag sAx es A3sAx., 
§=— s=— s= 


3 3 
2 h1sDx, 2 sx, >; Ags0x, 
$= 


s= s==1 
Cn ne Cae ke Tepe SP MC TES i ue 16 
=|@x, Ax, Axy|| Ho, Aeq Gog} = 1] ax, Ax, Ax, 
by, dy, Ox,||%sq Seq M33 De. De. Dye 


= ct [a X blir, iz, is, (4) 


where the sign + or — is taken depending on whether 
the determinant | a@;, | equals +4 or —1, or, which is 
the same, on whether the transformation in question 
changes orientation. 

In this case the following rule has been used for multi- 
plying the determinants: the element y;, of the matrix 
of the product || ;, || is determined as the product of 
the ith row of the first determinant by the kth row of the 
second (see Sec. 2, property (j)). 


gx 
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Thus, we have proved by a computational method 
that the vector product of two vectors is invariant with 
respect to the transformations of rectangular coordinate 
systems which do not change their orientation. 

Transformations (3) and (4) are of interest as they are 
generalized for the case when the role of the vector a 

0 0 a 


is played by the symbolic vector y = ( 
which is of great importance in mathematical analysis. 


Sec. 19. Transformation of Rectangular 
Coordinates in a Plane 


Let there be given a plane AR, in which a rectangular 
coordinate system (z,, z,) is specified. Let then 


ii = (1,0), 2% = (0, 4) 


be basis vectors of the axes z,, r,. The basis vectors i’, 
i? form an orthonormalized basis in R,. 

An arbitrary unit (normal) vector b! can be written 
in the following way: 


b= (cosa, sine) OXa< 2n). 


The unit vector orthogonal (perpendicular) to b! will 
be denoted by b?. It may only correspond either to the 


aU It s 
angle a+, or @—-~>. Since 


2 
JT ‘ rs It 
cos(a+ +] = —sina, sin («++ )=cosa, 
qu . : af 
cos(a—-) =sina, sin(a—+ ] = —cosa, 


all possible orthonormalized systems b', b’? in R, are 
defined either by the equalities (Fig. 34) 


b!'—j'cosa+i?’sina, 


< 2m), : 
Se ee at a (1) 


corresponding to the rotation of the axes about the origin 
through an angle a with the orientation retained, or by 
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Fig. 35. 


the equalities (Fig. 35) 
b!=itfcosa +i’ sin a, 


b?= i! sing +i? cosa, 


(1") 


corresponding to the rotation of the axes about the origin 
through an angle « with the orientation changed. 
Both transformations are united in the following for- 
mula: 
bt = Gyi' + agai’, (4) 
b? —_— Oe i! + Qool”, 
where the transformation matrix 


Oy, Ayo 


A= 


Qoq Ag 
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is orthogonal (the sum of the squared elements of any of 
its rows or columns is equal to 1, while the scalar product 
of two different rows or columns is zero). 

Any defined orthogonal transformation (4) is, in fact, 
one of transformations (1’), (1”) for a certain a. 

By virtue of the orthogonality of matrix (2), it fol- 
lows from (1) that 


it = a,b! + @,,b’, 
i? = a ob! + aepb?, 


(3) 


and, thus, we have obtained a transformation inverse to 
transformation (1) with the matrix 


yy Ay 


— A*, 


Ay Ago 


which is conjugate to A. 

Let there be given in a plane an arbitrary vector (point) 
a having the coordinates (z,, z,) and (x,, z,) in the old 
and new coordinate systems, respectively. Then 


a = Zi! + 2z,i*7 = xb’ + z;b’. (4) 
By formulas (3) and (4) 
tibt + xb’ = xy (ayyb* + aayb*) + Lg (&2b* 4+- Goeb”) 
== (4,7, + Gyo%) bt + (Goy2y + Agate) b*. 


Therefore, equating the components of identical basis 
vectors b', b?, we get 


Ly = Uy Ty + AgoL., 
L, = Ag Xy + Aggy. 
And, by formulas (1) and (4), 


zi! a Lyi? ms zy (i as 191") oo x, (to3i" as Qo oi”) 


(9) 


= (Ay12, + Og1%5) it + (aor, + Oyo) i’, 
whence, equating the components of i! and i*, we get the 
formulas inverse to (5): 

TX 442, + OZ, 
Dy = Oyo%, + gel,» 


(6) 
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If, in addition to transformation (6), the origin of the 
axes x}, z, is transferred to the point O’ having the coor- 
dinates x, = 2}, X, = x}, then formulas (6) will, obvious- 
ly, be complicated as follows 


__ 2,0 f t 
LyX, -+ AqyZ, + bo4Lo, ! 


0 / ' 
Ly = Ly -+ AyoL, + haot,. 


(7) 


Thus, an arbitrary transformation of the rectangular 
coordinates (z,, z,) to rectangular coordinates (z,, 7z,), 
with the origin of the coordinate system (z,, zx.) trans- 
ferred to the point O° = (z{, x2), is expressed by (7), 
where the matrix 
Ay, Hog 


Ain Hog 
is orthogonal. 
The corresponding transformation retaining the orien- 
tation of the given coordinate system has the form 
t,=2)+2;,c0osa—zx, sina, 


(7") 


and the transformation, changing the orientation, has 
the form 


0 fs t 
Log =, +2, SINe+7F, COSA 


ty= 2} +x cosa-+x, sing, 7") 
tg =2,-+2, sin a—-z, cosa 

(the matrices of the coefficients of x, and x, in (7') and 
(7") transpose (1’) and (1”), respectively). 


Sec. 20. Linear Subspaces in %,, 


The set L in R, (LC R,) is called a linear subspace 
of the space R,,, or, in brief, a subspace in Ff, if from the 
fact that two certain vectors x and y belong to Z (x, y € L£) 
it follows automatically that the vector ox + By also 
belongs to ZL (ax + By € L), where a, B are numbers. 
The subspace L is called m-dimensional if there is a linearly 
independent system at, ..., a” in it consisting of m 
vectors, and there is no system consisting of m-+ 14 
linearly independent vectors. 
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Here, if a is an arbitrary vectorin LZ (a € L), then the 


system a', ..., a™, a is linearly dependent, i.e. there 
exists a nontrivial system of numbers A,, ..., Am: 
Am+ 1 such that 

Aart... + Apa™ + Amaya = 0. (1) 


Here Am+, =40 otherwise it would be 
Matt... t+ Aya™ = 0 


and in consequence of the linear independence of the 
system al, ..., a™ it would be A, =... =A, =O), 
the entire system Ny, - + +) Am, Am+, would also be trivial. 
Then equation (1) can he solved with respect to a 


= watt... + pma™ (Ws = —Ag/Ami), (2) 


i.e. can be represented in the form of a linear combination 
of the vectorsa?, ..., a™. On the other hand, the linear 
combination of form (2) belongs to L, since LZ is a sub- 
space. In this sense, the system at, ..., a™ is said to be 
a basis in L. Obviously, any other linearly independent 
system of vectors b!, ..., b”™ belonging to Z is a basis 
in L. 

Decomposing the vectors bi into the vectors a‘, ... 

., a”, we get 


b? = Py b,ca® (k==1, ..., m). 


Reasoning in the same way as in Sec. 16 for R, (where 
now i’ and a® have to be replaced by a® and b’, respective- 
ly), we can obtain that the system bt, b”™ is linearly 
independent when and only when the determinant Dx | SE 
+ 0, and that any independent system consisting of 
i<i m vectors cannot be a basis in L. 

The space R, may be regarded as a subspace R, having n 
dimensions. 

The set consisting of one zero vector 0 is a linear sub- 
space (20 + 60 = 0). It is said to have O dimensions. 
The vector 0 does not form a linearly independent system, 
since from the equality 40 = 0, where A is a number, 
does not necessarily follow that A is zero. 
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If the vector x® =~ 0, then the set of vectors of the form 
Xx, where A is an arbitrary number, is a one-dimensional 
subspace. The vector x® may be taken as a basis in it. 

Let ZL be a linear subspace in A,. The vector v € R, 
will be said to be orthogonal to L if it is orthogonal to 
any vector u € L. Let us denote by L’ the set of all vec- 
tors orthogonal to L. L’ is a subspace. Indeed, let v, 
v EL’, ive. 

(v, uy =0, Vue Z; 
(vi, u =0, Vue Z. 


Then for any numbers a, fp 
(av + Pv’, yu =a(v, yb + P(v',u =90, Vuel, 


i.e. aV + Pv’ EL’. 

The subspace L’ < R,, is termed orthogonal to the given 
subspace L — R,, if L’ is the set of all vectors each of 
which is orthogonal to JL. 

Proved below is a theorem explaining the structure of 
an arbitrary subspace LZ < R, and the subspace L’ Cc R, 
orthogonal to it. In particular, it follows from this theo- 
rem that if L’ is orthogonal to L, then the converse is also 
true: L is orthogonal to L’. 

Theorem 1. Let L be a linear subspace different from R,, 
and from a zero subspace. Then: 

(a) there exists a whole number m satisfying the inequalities 


1<m<n, (3) 

and an orthonormalized basis 
al... a™ (4) 
in L; if this basis is extended in any way to the basis in R,, 
Bae a ES cs, ge (5) 


then the linear subspace L' with the basis 
As ar ety (6) 
possesses the following properties: 


(b) L’ is a subspace orthogonal to L; 
(c) L is a subspace orthogonal to L’; 


~ 


122 Fundamentals of Linear Algebra and Analytical Geomeiry 


(d) any vector a€ R, can be represented in the form of 
a sum 


a=u-+ Vv, 


whereu€ L, vy € L’ and this can be done in a unique way. 
Proof. By the hypothesis, Z is different from a zero 

subspace, consequently, in ZL there exists a vector x 

different from 0. Normalizing x, we get a normal vector 


1 
—_ 
Be eee 


Let us denote by a® any (belonging to Z) normal vector 
orthogonal to a! (| a? | = 1, (a?, a!) = 0), provided that 
such a vector exists. Further, let us denote by a® a normal 
vector, belonging to L, that is orthogonal to a‘ and a? 
(ja? | = 1, (a3, at) = (a®, a?) = 0), provided that such 
a vector exists. This process will be completed at a cer- 
tain mth step, where m satisfies inequalities (3), i.e. we 
can find an orthonormalized system of vectors (4) belong- 
iag to LZ, but the latter will not contain a unit vector 


orthogonal to the vectors a’, ..., a”. Indeed, m > 1, 
since, automatically, at ¢ L. On the other hand, m can- 
not be equal to n. Otherwise, the vectors a!, ..., a” 


would belong to Z and, together with them, all linear 


combinations >'\A,a* would belong to the subspace L. 


h=1 

And this would result in that Z coincides with A,, but L 
is different from A,. The obtained orthonormalized sys- 
tem al, ..., a” is a basis in L. Indeed, together with 
the vectors al, ..., a™, all their linear combinations 


m™m 

. ts 
>\A,a" belong to L. But there are no other vectors in L, 
k=1 

since if we assume that a certain vector a € Z is not such 
a linear combination, then a could be written in the form 
of a sum 


a == x (a, a’) a®-+y, (7) 


a 
| 
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where y 0. Since the vectors a and a* belong to the 
subspace ZL, we had to conclude that the vector 
m 
y=a— )) (a, a*)a* 


also belongs to LZ. But the vector y is orthogonal to all a® 


(s = 1, ..., m) (see (4) in Sec. 17). The normalized 
vector 
1 
bay) _ 
would also belong to Z and would be orthogonal to all 
a’ (k = 1, ..., m). But this is impossible due to the 


maximal property of the number m. This proves state- 

ment (a) of the theorem under consideration. 
Orthonormalized system (4) is supplemented to ortho- 

normalized basis (5) on the basis of Theorem 1 from Sec. 17. 


We denote by L’ the subspace of all linear combinations 
nr 
v = >jp,a" from the vectors of system (6). Every such 
hk=m+1 
vector is, obviously, orthogonal to any vector u€ ZL 


m 
that is represented in the form of a sumu = DjAga*. 


h=1 
On the other hand, if a€ R, is an arbitrary vector orthog- 
onal to all vectorsu€é JL, to a!, ..., a™ in particular, 


then its decomposition according to basis (5) has the form 


h4s 


Ti 
a= >! (a, a") a? = 
k=1 k=m-+i 


(a, a®) a®, 


i.e. a€L’. We have proved statement (b) of our 
theorem. 


mM 
Further, any vector u= >) A,a"€L is orthogonal to 
KA 


Tr 
all vectors v= )>; p,a°€L’ and, if it is known that 
k=m+1 
Tr 


a certain vector a= ee (a, a®).a® 


k=1 
vectors from L’, to a™+1!, ..., a” in particular, then 


is orthogonal to all 
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m 
a= >) (a, a*)a®, that isa€Z. And so, we have proved 
k=1 


statement (Cc). 

Finally, if a€ AR, is an arbitrary vector, then it can 
be represented in the form of a sum in a unique 
manner: 


nr 
a= > (a, a*) a*=u-+v, 
k= 


m Tt 
u= >) (a,a*ja"E€L, v= >) (a,a*a® EL’. 
h=1 h=m-+1 

Thus, Theorem 1 has been completely proved. 

Theorem 2. Let L be a subspace in R, having m dimen- 
sions. Then the subspace L' C R,, orthogonal to L has 
n—m dimensions, and L is, in its turn, a subspace ortho- 
gonal to L’. 

Proof. If £ is different both from A, and zero subspace, 
then this theorem is, obviously, containedin Theorem 1. 
Rejecting from the formulation of Theorem 1 everything 
concerning bases (4), (5) and (6), we obtain Theorem 2. 

Let Z be a zero subspace. Since any vector a € R,, is 
orthogonal to 0, Z' = AR, and the dimension of. R, is 
equal to n — 0 =n. Conversely, the vector 0 is orthog- 
onal to all vectors, a¢é R, = L’. There are no other 
vectors orthogonal to all vectors R,, since any vector 
different from 0 is no longer orthogonal to itself. Thus, 
we have proved that ZL is orthogonal to L’. 

If £ = R,, we resort to similar arguments. 

Corollary 1. Let there be given a system of vectors 


ee ae cae (9) 
and let L' be a subspace of vectors v each of which is or- 
thogonal to the vectors of this system: 

(vim) 0) Abe 4, bee mi). 
Further, let there be given a vector a orthogonal to all indi- 


cated vectors v, i.e. orthogonal to the subspace L’. Then 
a is a certain linear combination of the vectors of the given 
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system (9) 
m 
a = = Ax" 
k=1 


Proof. Consider the subspace L consisting of all pos- 
sible linear combinations of the vectors of system (9), 
i.e. any vector u € Z is a certain linear combination 


™m 
u= >) A,x®. 
h=1 


In this case we shall also say that the subspace L i 
stretched on the vectors of system (8). 

Since any vector v € L’ is orthogonal to the vectors of 
system (9), it is, obviously, orthogonal] to any vector 
u € ZL. This shows that subspace L’ is orthogonal to 
subspace L. But then, by Theorem 2, L is also orthogonal 
to L’, i.e. £ consists of all vectors u orthogonal to L’. 
By hypothesis, a is one of such vectors u, consequently, 
a is a certain linear combination of the vectors of sys- 
tem (9). 


Sec. 21. Fredholm-type Theorems 


This section deals with the theory of linear equations 
which is parallel to the theory discussed in Sec. 4. 

This is a determinant-free theory. The determinant of a 
system of equations does not enter into its formulation 
explicitly. Its advantage consists in that it served as the 
foundation and analogy for many generalizations in 
mathematical analysis. The first important generaliza- 
tions of this kind belong to E. Fredholm*. 

We consider the linear operator A once again (see 
Sec. 15): 

y=Ax (x€R,). (1) 


It associates every vector x € A, with a vector y € R, 
with the aid of the equations 


nr 
Yi = di ists (as ere OF (2) 


* Fredholm, Erik Ivar (1866-1927). Swedish analyst and phy- 
sicist. 
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Here 


A=|| a: [| = ses eas eo See (3) 


Ani ano eo @e ann 


is a given quadratic matrix. Operator A is associated with 
the conjugate operator 


y= A*x (x€R,), (1*) 
defined by the matrix 


y= | | (3*) 
Gin Gan «++ nn 


which is conjugate to (2). With the aid of the components 
of the vectors x, y, it is written in the form: 


Yji= ea Qi j;X1 = A ae nh), (2*) 


i.e. the component y; is expressed in terms of the coor- 
dinates of the vector x with the aid of the jth row of matrix 
A* or the jth column of matrix A. 

The following equality is valid 


(Ax, z) = (x, A*z), Vx, z€ Ry, (4) 
which is true for all x, z € R,. Indeed, 


Tr 


(Ax, 2) = 2 Yi25= > (> Aisks) 2 2% 


s=1 
33 01921) Xs (x, A*z). 
ee i=1 


Equation (4) is characteristic for a conjugate operator, 
since, if for some linear operator B the equality 


(Ax, Z) a (x, Bz), Vx, Z E Rns (9) 
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is fulfilled, then it is necessary that B = A*. Indeed 
(B = |I 4: Il). 


(Ax, Z) = va 


nr th 

AY NI 

> 21 Vist s2in 
§ 


(x, Bz)= >) s b542;l3 = s! s b,;152;- 


s=13=—1 i=1is=1 


It follows from (5) that 


> > Azets2i—= >) >) b5i%s2;, Wx, ER, (6) 
i=1s= i=1 s=1 

whence a;, = b,; (i, s = 1, ..., nm), which can be veri- 
fied if we set in (6): x = (0, ..., 0, 1, 0, ..., QO) and 
z=(0,..., 0, 1, 0, ..., 0), where 4 occupies the 
sth place in x and the ith place in z. 

Hence, operator A* conjugate to the linear operator A 
can also be defined as such a linear operator for which equali- 
ty (4) is fulfilled. 

Equalities (1) and (4*) may be considered as equations— 
given a vector y €R,, and we find x€R,, for which 
either equality (1), or (1*) is fulfilled. 

The corresponding homogeneous equations have the 
form 


Ax = 0 (15) 
or 
2 ists =0 (eal 5 23:28) (25) 
and 
A*z = 0 (1*) 
or 
pa ayjz=0 (j=1,...,n). (2*) 


Let us denote by Z the image of the space R,, with the aid of 
operator A: 
L=A (A), 


and by L’ the subspace of all vectorsz satisfying homoge- 
neous conjugate equation (1*). 
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We have called L’ a subspace, since, together with z, 
z', it also contains az + fiz’, where a and £B are numbers: 


A* (az + $2’) = aA*z + PA*2’ = 0. 


LE is also a subspace, since, if y, y’ € L, then there exist 
vectors x, x’ € R, such that y = Ax, y’ = Ax’, and, 
consequently, 


ay + By’ = aAx + BAx’ = A (ax + Bx’), 


ie. ay + By’ EL. 

The following lemma is valid (see Sec. 20, Theorem 2). 

Lemma 1. Subspaces L and L’' are mutually orthogonal, 
i.e. L’ is the set of all vectors z% each of which is orthogonal 
to L, and L, in its turn, is the set of all vectors y each of 
which is orthogonal to L’. If L has k dimensions, then L’' 
has n — k dimensions. 

Proof. Consider the equality 


(Ax, 2) = (x, A*z), (7) 


which is true for all x, z €R,. Let z be a vector orthog- 
onal to JZ, then, for it, the left-hand member of (7) is 
equal to zero for all x € A,, but then the right-hand mem- 
ber is also equal to zero for all x € R,, for x = A*z, 
in particular: 


(A*z, A*z) = 0. 


Consequently, A*z = 0. We have proved that if the vector z 
is orthogonal to L, then it satisfies the equation A*z = 0 
(i.e. 2 € L’). 

Conversely, let the vector z satisfy the equation A*z = 
= (). For such a z the right-hand member of (7) equals 
zero for any x, but then the left-hand member is also 
equal to zero, i.e. z is orthogonal to all vectors of the 
form Ax, i.e. to all vectors y € L. In other words, z is 
orthogonal to L. 

Thus, we have proved that U’ is the set of all vectors z 
orthogonal to subspace Z. But then, by Theorem 2 of 
Sec. 20, and conversely, isthe set of all vectors y ortho- 
gonal to LZ’, and the sum of the dimensions of LZ and L’ 
is equal to n. The lemma has been proved. 

The following theorem holds true. 
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Theorem 1. Jn order for the equation 
y = Ax (4°) 


to have a solution for a given vector y € Ry, it is necessary 
and sufficient that the vector y be orthogonal to all vectors z 
satisfying the homogeneous conjugate equation 


A*z = 0 (1%) 


The solution x of equation (1), provided that it exists, can 
be written in the form of a sum 


x=x°+u, 


where x° is a certain particular solution of equation (A), 
and u is an arbitrary solution of the homogeneous equation 


Au = 0 (1) 


Any indicated sum is a solution of (1'). 

Proof. By virtue of Lemma 1, if 2 = A (R,) and L’ 
is the set of all z satisfying the equation A*z = 0, then L 
and JL’ are mutually orthogonal subspaces: But then, 
if for y there exists a solution of equation (1), y € Z and 
it is necessary that all z € L’ be orthogonal to y. But if 
the vector y is orthogonal to all z € LZ’, then y € £, ive. 
there exists an x for which y = Ax. 

Let now there exist a solution of equation (1’) for 
vector y. We denote it by x’: 


y = Ax’. 
Then, obviously, the sum x® + u, where Au = 0 is also 
a solution of equation (1°): 

A (x® + u) = Ax®+ du =y+O0=y. 
Conversely, if x is an arbitrary solution of equation (1’), 
and x® is a definite particular solution, then 

y=Ax, y = Ax’, 
and, consequently, 
0 = Ax — Ax®° =A (x — x’) = Au, 
where u = x — x’, ie. x = x® + u, where u satisfies 
the equation Au = 0. 
9-01496 
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Remark. Taking the space R, as an example, let us 
clarify the connection of Theorem 4 with the Kronecker- 
Capelli theory. Let the vector y = (y,, y.) be orthogonal 


to all the solutions of the system 
04424 + Aaq2. = 0, 
A424 + doo%_ = 0. 


(8) 


Let us show that the ranks of matrix A and the augment- 
ed matrix 


B= 


Qyy Aya A 
Goi Aon Yo 


are then equal to each other. If rank A = 2, then, obvious- 
ly, rank B = 2. Let rank A = 1. It is always true that 
rank B > rank A = 1. Therefore, we have to prove that 


(en Qing Ys 
G21 Ye G22 Ye 


Indeed, since y is orthogonal to nontrivial solutions of 
system (8), we have 4,2, + Yo2, = 0. Therefore, holding 
that z, 0, 


Ay = G42 — Aaa 


=(), A,= = (0). 


Yoto _ Ye 
= Aye + Gay a oy (4424 + g429) = 0, 
Ag = AyeYo— Goo 
Yo2o _ Ya 
= QyoYa 4 Boe Zz oy (Q4o24-+ Ao229) =(0. 


Hence it follows that rank B = rank A = 1. 

Conversely, let the vector y = (y,, y.) be such that 
rank B = rank A, then (4) has a certain solution (x, 2). 
We are going to prove that y is orthogonal to the solu- 
tions z = (2,, 2.) of system (8). Indeed, 


Yr2Z4 + Yo%eq = (Gyy%, + AyoLe) 2 + (Goy% + Ay0%o) 2g 
= (G42 + GZ) TH + (Ay921 + Gq92q) Lo = O-X, + O-z, 


= 0, 
Theorem 2. The homogeneous equations 


Ax = 0 (4) 
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and 
A*x = 0 (1°) 
have an equal number of linearly independent solutions. 

In particular, if one of these equations has only a trivial 
solution Q, t.e. a zero of independent solutions, then it is 
true for the other equation as well. 

Remark. In the latter case equation (1) has a unique 
solution. 

Proof. The matrices A and A* have one and the same 
rank which will be denoted by &. They also have one and 
the same determinant A. 

If k =n, then A + 0 and equations (19) and (1%) have 
only trivial solutions 0. In this case, according to Theo- 
rem 1, equation (1) has a unique solution for any y € Rj. 

Let now 1<k <n. On renumbering the equations 
and components, the determinant 


Ay, --- Giz 
it a nd ~ 0. 
Qh, .-- App 
Let us now write the first k equations of (4°) in the form 
AyyXy os TF AgnLp= — Ay, neyTRta— ---— Anny 
bad oe wal wl ee eee ee ee (9) 
Only «+. + Onnly = —Gp, ptilaii— -+- —Agntn- 
Given below is a table of n — k vectors: 
1 
Rt (Gv ope gh ae OD), 
2 2 
x? = (24, a ee ge RS 0, 1, 0, ies O)s (10) 
xr-h — (gn-h, pee O; , 0, 4) 


In order to obtain the first vector, we substitute into 
system (9) 
Tray = 1, Tepg = 0, ..., & =O 

and solve it with respect to z,, ..., £,. We shall denote 
the unique solutions obtained here by z}, ..., 2%. To get 
the second vector, we substitute into (9) 

Cry = 0, Lrg = 1, tr+3 = Q, ee ayn Ly = 0 
9s 
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and find the numbers 23, ..., xj, and so on. Vectors (10) 
possess the following properties: 

(1) System of vectors (10) is linearly independent, since 
the rank of the matrix of these vectors is equal to the 
number of these vectors py = n — k. 

(2) Every vector of system (10) is a solution of (any!) 
equations (15) or Ax = 0. 

(3) All possible solutions of the equation Ax = 0 have 
the form 

\4x! ye. ae +- hnsgx 
where Aj, ..., An-z are arbitrary numbers. 

These three assertions are usually replaced by the words: 
equation (1,) has n — & linearly independent solutions. 

Reasoning in such a way and taking into consideration 
that rank A = rank A*, we prove that the equation 
A*x = 0 also has nm — k linearly independent solutions. 
The theorem has been proved. 

Theorem 3. Jf one of homogeneous equations (19) or 
(13) has k linearly independent solutions, then the other 
also has k linearly independent solutions; and the images 
L =A (R,) and L* = A* (&,) of space Ry obtained with 
the aid of operators A and A* are subspaces of n — k di- 
mensions. 

Proof. The first statement of this theorem concerning 
the equality of the numbers of linearly independent solu- 
tions of homogeneous equations (1) and (1*) is Theorem 2, 
while the second is Lemma 1, by virtue of which the di- 
mension of subspace Z is equal to n — &, where & is the 
dimension of the subspace L’ of the vectors z satisfying 
the equation A*z = 0. Analogously, the dimension of L* 
is equal to n — k, where k is the number of dimensions 
of the subspace of the vectors u satisfying the equation 
Au = 0. 
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Quadratic Form 


A linear operator 
n 


Ur= 2 Apt, (k=1,..., 7), (1) 
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where we considered a,; to be real numbers, or, briefly, 
y=Ax (KER, yER,) (2) 


is called self-adjoint if it is equal to its conjugate operator 
(A = A*), ie. if 


Ax = A*x, VxER,, (3) 
or in other words, if matrix A is symmetric: 
Apt = Ain (k, = t ees n) (4) 


(see (3) and (3*) in the preceding section). 
For a self-adjoint operator the following characteristic 
equality holds true: 


(x, Az) = (Ax, z), Vx, zER, 


(see (4) in the preceding section). Obviously, 
nr Tr nm Tm 
(x, Ax) = > Lp pay Arltl = 2 2 AniLpL1 (Apr = ap). 


k=1 
(4°) 


The expression on the right in (4’) is called the quadratic 
form of the nth order. This is a continuous function of 
the vector x or, which is the same, of the variables z,, ... 


ot, deo: 
We shall consider this function on the set S of values of 
x having the unit norm (|x | = 1). The set S is a sphere 
in R, of radius 1 with centre at the point 0. S isa bounded 
set; belies, it is closed*: if the points of the sequence 
{xY}(v = 1, 2,...) belong to S (i.e. [x¥ |] =1, v = 1, 
. and this sequence tends to a certain point 
x? € R,, “(xy +x’, v > oo), then, inevitably, x° € S, i.e. 
| x° 4, since 14 — |x°{ |= xv] — ix° || <|x” — 
— x° | +0, whence | x° | = 
Let us find the maximum of quadratic form (4') on the 
sphere S. Since form (4’) is a continuous function on a 
closed bounded set, its maximum on S is achieved for 
a certain unit vector x! (|x! | = 1). Let us denote this 


* See our book “Differential and Integral Calculus” (the second 
part of our series “Higher Mathematics”), Mir Publishers, Moscow, 
1982, Sec. 8,12. 
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maximum by A,: 
A, = (Ax!, x") > (Ax, x), Vx: [x] = 1. (5) 
We then introduce a subspace L’ orthogonal to the 
vector x!, i.e. the set of all vectors v each of which is 


orthogonal to x!', and take an arbitrary unit vector 
v? (jv? | =1) in LZ’. The vector 


cos a-x! + sin a-v? 
depends on a@ and has a unit norm: 
cos @-x! + sin a-v?® | 
= (cos a-x! + sin a-v°, cos ax! 4+ sin a-v°)’/2 
= (cos? a -++ sin? a)'/2 = 1 
For a = 0 this vector turns into x'. But then the func- 
tion 
» (2) = (A (cos a-x? + sin a-v°), cos a-x! + sina-v®) 
reaches its maximum at the point a = 0 (p (0) = (Ax', 
x')) and, by virtue of a necessary condition for an extre- 
mum, 
ip’ (O) = 0. 
Let us compute this derivative. We have 
wy (a) = cos? a (Ax!, x") + sin 2a (Ax', v°) 
+ sin? a (Av’, v?). 
Consequently, 
pp (a) = —sin 2a (Ax', x’) + 2 cos 2a (Ax', v°) 


+ sin 2a (Av®, v°) 
and 
wp (0) = 2 (Ax?, v®) = 0. 

We have obtained that the vector Ax! is orthogona 
to all unit vectors v® € L’ and, consequently, to any 
vectors v € LZ’. But then Ax! differs from x! only by a 
factor (see Corollary 1 at the end of Sec. 20), i.e. 


Ax! = ix!, 


where A is some number. 
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From the first equality of (5), taking into account that 
|x? | =1, it follows: 


A, = (Ax?, x4) = 2. 


Thus, we have proved that the maximum of quadratic 
form (4’) on the unit sphere | x | = 4 is reached at a cer- 
tain point x!, 


max (Ax, x) = (Ax!, x!) = Ay. 
jxj==4 


And 
Ax! = Ayx!, |x! | = 1. 


We see that a nontrivial (nonzero) vector x! is mapped 
with the aid of operator A into A,x!, i.e. into a vector 
collinear with x!. 

Such vector is called the eigenvector of operator A, 
and the number A, is known as the eigenvalue belonging to 
this vector. 

We shall now consider the operator A on the subspace R? 
defined as the set of vectors x (€ R,) orthogonal to the 
vector x! (above it was denoted by Z’). Riis an (nm — 1)- 
dimensional subspace with orthonormal bases consist- 
ing of nm —1 vectors. Our aim consists in finding one 
such basis, as we shall see, naturally related with ope- 
rator A. 

It is important to underline that the image A (R‘) 
of the subspace R! belongs, with the aid of operator A, 
to R}, since, if (x, x!) = 0, then 


(Ax, x!) = (x, Ax!) = (x, A x!) = 1, &, x’) = 0, 


ie. Ax € Ri, 
Self-adjointness of the operator A in R" is retained 
in a trivial way, since the equality 


(Ax, y) = (x, Ay) 


being true for all x, y € Ry, is also true for all x, y € R’. 

And so, we now consider a self-adjoint linear operator 
A in a linear subspace R‘ of dimension n — 1. Treating 
it in the above way, we can reveal in A‘ a unit vector x? 
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such that 
max (Ax, x)=(Ax’, x2) =A,<A,. 
{x|=1 [x2|—=4 
(x, xl)=0 (x2, x1)=0 


The point is that a unit sphere S! in R! is defined, 
obviously, as the set of unit vectors x orthogonal to x!. 
In this case we may write 


Ax® = hax’. 


We have found a second eigenvector of the operator 
A—the vector x* and the eigenvalue i,, belonging to it, 
which does not exceed i, (a diminished domain of consid- 
eration may result only in a reduced maximum). And 
here (x!, x’) = 0. 

Proceeding in a similar way, we can introduce a sub- 
space #? of dimension n — 2, orthogonal to the vectors x! 
and x’, show that the operator A maps R? into R*, and 
define a third unit vector x3, orthogonal to x! and x’, 
such that 

max (Ax, x) = (Ax, x°) = A, 


}xJ=1 
(x, x1)=0, (x, x2)=0 


and 
Ax? <= hex” (A3 <= No <= hy). 


Continuing this process by induction till the mth vector 
x”, we shall obtain an orthonormal system of vectors 


My Se ee (6) 
and a system of real numbers 
M4, dos ee ey Mns (7) 


possessing the following properties: 
Ax*—A,x® (k=:1,..., 7), 
Lah ohe | 6) 
Remark 1. If a self-adjoint operator A is considered in 
a complex space A, it is possible to prove that the eigen- 


values of this operator are all the same real numbers. 
Indeed, let x be the eigenvector of the self-adjoint ope- 
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trator A and A its corresponding eigenvalue, i.e. Ax = Ax, 
x = 0. Then, since A is a self-adjoint operator, (Ax, x) = 
= (x, Ax) or (Ax, x) = (x, Ax). 

Using the properties of the scalar product in the com- 
plex space R, (see Sec. 6), we have: A (x, x) = A (x, x). 
Since (x, x) +0, we get 1 =A, i.e. J is a real number. 

Besides, the eigenvectors of the self-adjoint operator A 
associated with different eigenvalues: 


Ax, =X, AXg = Nokes (Ag ~ My)3 

(AX), Xz) = (AX, Xo) = Ay (Ki, Xa); 

(xy, Ax) = (X1, AgXe) = Ae (Ki, Xa)5 
O = (Ay — Ae) (Xi, Xe)3 


are orthogonal. Since A, A,, then (x,, x.) = 0, i.e. 
the eigenvectors x, and x, are orthogonal. 

We have obtained a complete system of eigenvectors of 
operator A associated with the corresponding eigenvalues. 
Since orthonormal system (6) belongs to AR, and consists 
of n vectors, it is a basis in R, (see Sec. 17). Therefore an 
arbitrary vector x € R, can be decomposed according to 
this system: 


ss es (x, x") x’, (9) 


Then our self-adjoint operator A can be written in the 
following way: 


Ax=A (3) (x, x") xt) = 3} (x, x) Ax! 


= Di da(x, x*)x* (40) 


Thus, we have proved the following theorem: 

Theorem 1. Zo a self-adjoint operator A in space Ry 
there corresponds an orthogonal system of vectors x1, ..., x” 
Uae of R,,) associated with a system of real numbers h,, .. 

- +; A, Such that Ax is represented in the form of 
sum (10) for any x € Ry. 
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Quadratic form (4’) is then correspondingly written 
as follows: 


nr 


(x, (x, Ax) = (3 oe, x Di he (x, x*) x°) 


$= 


=. (x, x2, (4”) 


In practice, we frequently proceed from a_ certain 
quadratic form 


nr nr 
> DY) Garey: (yr =a). (4) 
ae ee 


In order to apply to it the obtained results, it is possible 
to determine in connection with it the linear operator 


y = Ax, 
defined by the equalities 


n 
yi= 2s aur; (a1; bdtgte)e 
I= 


By virtue of the condition a@,; = aj, this is a self-adjoint 
operator, therefore Theorem 4 is applicable to it. In terms 
of the quadratic form, Theorem 1 can be reformulated 
as follows. 

Theorem 2. Let there be given a quadratic form (4’) 
in a certain n-dimensional coordinate system (x1, ..., Ln) 


of space R, with the basis vectors i', ... i? (x= » tal A), 


There exists a rectangular coor oe system Es. saga) 
with the basis vectors x!, .. ", forming an orthogonal 


basis (x= s E,x") and a system of Breal numbers 
k=! 


Aa, - 2+) An Such that in this system quadratic form (4’) 
is a sum of the squares of the coordinates & of the 
vector x multiplied respectively by the numbers i,: 


4 


ims 


Te 
Apilpt) = 2 AN s&3- (4”) 
s=1 ae 


w 


4 
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The transition from the left-hand member of (4”) to 
its right-hand side can be realized if the decompositions 
of the vectors x!, ..., x” in terms of the basis vectors 


vf, ..., i” are known. Let 


n 
x! = ps B;si* 
s= 


(see Sec. 17, (7), where a;,, a’ must be respectively re- 


placed by £;,, i’). Since i’, ..., i and x’, ..., x” are 
orthonormalized bases in R,, the matrix 
A= | Bis | 


is orthogonal. We regard that it is known. One and the 
same vector x can be decomposed according to two bases: 
nr nr 
x = >; fi = >; E ax! 
j=1 


s=1 


But then 
bd n n n n 
DY by = >) ED) By = DD Bysbs) f 
j=1 j=1 s=t s=1 j=l 


and, by virtue of linear independence of the system i’, .. . 
-, 1”, we get 


Tr 
t= 2 ByE; (s=41,..., 7). (11) 

j= 
Thus, the passage from-the coordinates &,, ..., &, to 
the coordinates z,;, ..., 2, 1s realized by means of mat- 


rix A* conjugate to A (i.e. with the aid of the rows of A* 
or columns of A). 

Substituting expressions (11) for z, into the left-hand 
member of (4"), we must obtain its right-hand mem- 
ber. Let us write this equality: 
ue aS 7 = nr nr nr nr 
> Dy ant Dy ByrEy >) Bubs = Dd) Dd) DD Byw (D) anBur) £45: 
k=4i=1 j= i=1 j=1 i=1 R=1 i=4 
a 2 As&s = a » Shik s5:, 
s=14 j=1 i=! 

0, jt, 


1 tj is Kronecker’s symbol. 
? ~~ 44 : 


where 6;,; = | 
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Kquating the coefficients of identical £;&;, we get the 
following equalities 


nr Tr 
2 B yn a an iby) = 5:4; (ti, J=1,-.., 0), 


which can be interpreted in the following way (see Sec. 15, 
(6)). For the matrix 


A = |[ an: || (Gat = an) 
of the self-adjoint operator A there exists an orthogonal 
matrix 
A = |! Bjs || 
such that 
A-A-At = Y, (12) 
where 7/ is a certain diagonal matrix 
A, 0 O... 0 
aI O kA, 0... O (13) 
0 0 0... An 


(A; real numbers) called canonical. 
Note that for the orthogonal matrix A 


At = A*, 


Since the determinants of orthogonal matrices | A | = 
= | A! |= +41, it follows from (12) that 


TT as= IU) IAL LAL A= TAL (4) 


We have proved, in particular, the following theorem, 

Theorem 3. If the determinant | A | of a self-adjoint 
matriz A is not zero (| A | #0), then oa of its ie Makai 
Ay, - + +, An are not equal to zero (Ay # 0,7 = 1, , n). 

It follows from Theorem 2 that 

(4) If A, 5>...>A, > 0, then the quadratic form 
is positive for any vectors £0, and, consequently, 
for any vectors x = 0. In this case it is called sirictly 
positive, | 
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(2) If O > dy Sg DS .. . S Aq, then the form is nega- 
tive for any § 0, hence, for any x = 0, and is termed 
Strictly negative. 

(3) If A, >... oh, and A, = 0, then the form is 
nonnegative. There exists a direction (axis &,), along 
which it is equal to zero. This is a positive form, but not 
strictly. 

(4) Tia, =O DSA, >... Shp, then the form is nega- 
tive, but not strictly. 

(5) If A, > 0, and A, <0, the form is indefinite. With 
the zero point eliminated, it is positive along the axis &, 
and negative along the axis &,. 

It turns out that judging by the form of matrix || A || 
and by the sign of some determinants generated by it, 
it is possible to find out whether its eigenvalues are all 
positive, all negative, or both positive and negative. 
This is just the essence of Sylvester's theorem*. 

Let us form a number of principal minors of the qua- 
dratic form (Ax, x): 


i, Ais 


ee Se eee 


2 A 
2i Qoo 


According to Sylvester’s theorem, the following state- 
ments are possible: 

1. If A, >0O, A, >0,..., A, > 90, then the form 
is strictly positive (Case 1). 

2. If A,<0, A,>O, Ag<0,..., (—1)"A, > 0, 
then the form is strictly negative (Case 2). 

3. If A, >0, A, >O0, ..., Ay >0 or Aj <0, A, > 
= 0, ..., (—1)" A, > 0 and there is j for which A; = 0, 
then the form is automatically not strictly definite. 

4, In all other cases the quadratic form is indefinite. 


* Sylvester, James Joseph (18414-1897). English algebraist, 
combinatorist, geometer, number theorist, and poet. Cofounder 
with Cayley of the theory of algebraic invariants (anticipated 
to some extent by Boole and Lagrange). 
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Sec. 23. Quadratic Form 
in Two-dimensional Space 


For n = 2 the quadratic form has the form 
2 2 
AyyL Aye y%q 4 AgyLy%y + Aged} 


zZ 
= UyyT, + 24, 9% ,Te ++ Agal,, (1) 


SINCE Ay. = Qo). 

In order to reduce form (1) to a sum of the squares of the 
coordinates of the vector (&,, €) in a certain basis (x!, x’), 
we have (see the preceding section) to find the basis vec- 
tors x!, x? which are the eigenvectors of the self-adjoint 
operator A generated by the symmetric matrix 


Ay Age 
A= 


Go, Gao 


Let us indicate another method for finding the eigen- 
values and eigenvectors of operator A, which is different 
from the method discussed in the preceding section. 

Thus, if A, is an eigenvalue of operator A and x® = 
= (c«, x) ~0 is an eigenvector corresponding to 
this eigenvalue, then 

Ax? — NgX°. 
Let us rewrite this equation in the coordinate form: 


(di, — Ap) ©” + Ayr.” = 0, 
Ay,0,” + (da2— Ap) xs” =Q, (2) 
or in the operator form 
(A — A,z) x® = 0, (2') 
where £ is an identical operator. 
Thus, homogeneous system (2) has a nonzero solution x° 


which is possible if the determinant of system (2) or (2’) 
is Zero: 


Asi — ho A142 

= |A—A,z| =0. 
Q419 Ba, — ho ! = 

Hence, the eigenvalue A, is the root of the equation 
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which is called the characteristic equation of the opera- 
tor A (or of the quadratic form (Ax, x)). 

The converse is also true. If A, is a root of equation (3), 
then the nontrivial solution of the system 


(A — AjE)x = 0 (4) 


will be the eigenvector of the self-adjoint operator A. 
Consequently, the eigenvalues of the operator A are 
found in this case as the roots of quadratic equation (3) 


(Q4, — A) (Qe. — A) — ai, = O, 
Ve — (Q4, + s9) N+ A434, — ai, = 0. 
Solving this equation, we get 


My = [ayy + ag,+ V 402, + (44— @g2)*I, 
ek (5) 

An = > [Q44 + Gon — V 4a?, + (44 —@oo)"], 

whence it is seen that Ay >A,, the equality Ay = A, 

being obtained for a,. = 0, a4; = deg. For the sake of 


definiteness, we shall assume that a,, > a., (otherwise 
we change z, for x, and zr, for z,). Then A, > a, (y — 


4 a a 
— a1 = 5| V 4a + (411 — 429)* — i — a0) | => 0). 
It follows from (5) that the eigenvalues of the self- 
adjoint operator A are real numbers. 
Now, knowing the eigenvalues A, and A,, we find the 


unit i me as the solutions of system (4). Since 


aa (A — nz) < 


If 4, = A,, then the matrix A — ie consists only of 
Zeros (Ay = Ag = Gy, = Qoo3 Gy. = 0), i.e. its rank is equal 
to zero. In this case the quadratic form is already reduced 
to a sum of squares (a,, = a2; = 0). System (4) satisfies 
any vector x = (z,, z,). Therefore, the basis vectors of 
the coordinate system x? = i = (1, 0), x?=j = (0, 1) 
may be taken as eigenvectors. Any other system (x!, x?) 
of orthonormal vectors possesses the following property: 
in this system the quadratic form consists, as before, 
only of squares. 
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Now, if A, >A,, then either a,, 40, or a, = 0, 
1,  &,,. The second case may be omitted, since form (1) 
is already reduced to a sum of squares. Hence, let a,,. = 0. 
Then 

rank (A — A,£) = 1. 


Therefore, it is sufficient to consider one equation of 
system (4): 

(Qy, — Ay) ty + yet, = 0. 
Hence we have (a,, + 0) 

Ly = [(—ay + Aq)/aye] 24. 


The vector 
Ay 414 
aie ws) 
is the solution of system (4). Normalizing this vector, 
we get an eigenvector 


4 (1) (1) y! 
x = (2; 7 2, )= ly | 


“(Fae EA a) 
4 {Awan \2 iy (eae) 
V 1+ G12 ) 42) t+/ ay 


2 


y= (24, 


By carrying out elementary transformations, it is pos- 
sible to obtain the following equalities: 


Sl nea 
gi? = m3 VE | 


x A+ (au —aua/V 405, + (ur ara) (244 — Agq) MV 4a2, + (a4, — dep)”, (244 — Goo), 
i eens + —— 


Ti 
«V1 (ay: — Oag)/V 402, F (Ang) f 


For further purposes it is sufficient to take the plus 
sign in formulas (6). 

Analogously, knowing the eigenvalue 4,, we shall find 
the eigenvector x?. It turns out that 


x = (w'?, 2?) = (—2), 2). 


(6) 
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Let us now form the matrix of the operator (of an orthog- 
onal transformation) A transferring the basis vectors 
(i, j) into the basis vectors (x’, x’) 
(1) 


2 


1) 
Ly 


oe ot 
A= ns 
2 


(the rows are formed by the coordinates of the age of 
the basis vectors i and j with the aid of A, x! = 
= 2Di + aj, 2 = —2 i + 2 j). Then ie ere 
nates of the vector me 5) in the system (i, j) are related 
with the coordinates (—,, €) of this vector in the system 
(x, x’) with the aid of the columns of the matrix A: 


Ly = to Sy = a 


By Se VE ae Es 


Substituting these values into quadratic form (4) and 
taking into consideration formulas (5) and (6), we get 


yt, + 2dyotyx2 + Agety = A Et + AED. (3) 


The right-hand member of this equality is called the 
canonical form of the quadratic form. 

If A, and A, are of the same sign, then the quadratic 
form is said to belong to the elliptic type; if A, and A, 
have opposite signs, then it belongs to the hyperbolic 
type; and if one of the numbers (A, or A.) is equal to zero, 
then the quadratic form belongs to the parabolic type. 

It is seen from (5) that A,A. = a ,@,. — a@j,. Therefore 
the type of form (1) can be determined by the sign of the 
expression @;@,. — @ig. 

The quadratic form will be elliptic, hyperbolic, or 
parabolic if the expression a4,d@,, — aj, is greater, less 
than, or equal to zero. 

Example 1. Reduce the following to the canonical form: 


(7) 


In this case a,,=—1, ea, Qo, =2. Since 


3 D : ; 
A449 —4;,=2—-Z= 7 >0, the given quadratic form 


will be elliptic. We find the eigenvectors and asso- 
1001496 
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ciated with them eigenvalues by formulas (5), (6): 


1 3 1 3 
5) 
Maz. 


Further, 
en (YA =): ae 


In the system (x!, x?) our quadratic form has the form 


ao? oF 
= Gy 5 Sa" 


: { n 3 
Since rY = = = cos(—T), pS = 
é JU F < . 
= sin ( — +] , the transformation with the aid of the 
matrix 
x" — 2p 4 (1) (1) 2 (1) (1) 
A=} a pari} t= (2, QP), X= (—2,, 24”) 
2 1 


means a rotation of the system (x,, z,) about the origin 
through an angle of a@ = x/3 clockwise (see Example 1 
in Sec. 16). 


Sec. 24. Second-order Curves 


In some rectangular coordinate system (z, y), let there 
be given a plane curve defined implicitly by a second- 
degree equation 


Ax’? + 2Bry + Cy? + 2D2+ 2ky + F =0, (1) 


where A, B, C, D, E, F are preassigned real numbers; 
A, B, C being not all zero. This curve is known as a 
second-order (or quadric, or quadratic) curve. However, it 
may happen that there is not a single point (z, y) with 
real coordinates satisfying equation (4). In such a case 
equation (1) is said to define an imaginary curve of the 
second order. We shall not study imaginary curves. The 
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equation 
ety? = —1 


may serve as an example of a second-degree equation de- 
fining an imaginary curve. 

Listed below are most important particular cases of 
the general equation (1): 

(1) The equation of an ellipse 


22 y2 
= +a =1 (¢bb>0) 
with the semiaxes of length a and b. In particular, for 
a = b it becomes the equation of a circle 
get y? = @? 
with centre at the origin and radius a. 


(2) The equation of a hyperbola 


2 2 
Hl @>b>0) 


with the semiaxes a and b. 
(3) The equation of a parabola 


y° = 2px (p> 0). 
(4) The equation of a pair of intersecting straight lines 
axe — by =0 (0O<a, bd). 
(5) The equation of a pair of parallel or coincident straight 
lines 
ve—a@=0 (ae 0). 
(6) The equation defining a point 
x? + y* = 0. 


Let us dwell briefly on the above listed curves. 
The ellipse 


+4=1 (a>b>0). (2) 


For a = b ellipse (2) turns into a circle of radius a 
with centre at the origin, i.e. into a plane curve con- 
sisting of all points at a given distance a, called the ra- 


10* 
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dius, from a fixed point in the plane, called the centre; 
or, in other words: into the locus of points at the distance a 
from the origin. 

Let a> b. We set c = V a? — B®. Mark on the z-axis 
points #’,, fF, having the abscissas « = —c and x =e, 


Fig. 36. 


respectively. These are the foci of the ellipse. Ellipse (2) 
can be defined as the locus of points, the sum of the dis- 
tances of which from the foci F,, F, is a constant equal 
to 2a. 

Indeed (see Fig. 36), 


MF.=V(e+ePty, MF,=V (c¢—cP-y, 
a= VET TE+V HPT, 
whence 
2a—V (x+ePtP=V (ce-ch +y¥ 
and 
4a? + (x +¢)?-+ y?— 4a V (x +0)? + y? = (x—c)? + y?, 
4a® + 4ea= 4a V (xe? + ¥, 
a*4+- 2a?ca + 2x? = a* (x? + 2ex +c? + y?], 
— b27? = — a*h* + ay’, 


a2b? = bz? + ary? 
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whence equation (2) follows. Following the above compu- 
tations in a reverse order, we obtain that if the point 
(xz, y) satisfies equation (2), then the sum of its distances 
from F, and F, is equal to 2a. 

If we replace x by —z in equation (2), then the latter 
will remain unchanged. This shows that ellipse (2) is a 
curve symmetric about the y-axis. Analogously, ellipse 
(2) is symmetric about the z-axis, since its equation 
remains unchanged when y is replaced by —y. But then 
it is sufficient to study its equation in the frst quadrant 
(of the coordinate system), that is for z, y > 0. The portion 
of the ellipse found in the first quadrant is defined by 
the equation 

ES | ae 
y=—Va —r, OXr<a. . 
We see from this equation that our ellipse passes through 
the points (0, b) and (a, 0). Its ordinate y continuously 
aes as x continuously increases on the interval 
0, al. 

The ellipse is a bounded curve. It is situated inside 
a circle of radius a with centre at the origin (for the coor- 
dinates of any point (z, y) of the ellipse the inequality 
et y< a (5 + Z) = a” holds true). 

From Fig. 36 we see that the ellipse is a continuous 
closed curve. In the first quadrant it is convex upward. 
A tangent to this curve can be drawn at any of its points*. 
All these and many other properties can be successfully 
studied using the methods of mathematical analysis which, 
furthermore, provides the means for exact definitions of 
the above mentioned notions, viz. continuity, convexity, 
etc. 

The equation of an ellipse can also be written in the 
parametric form: 


x=acos8, 

y=bsin 0 page caiea): (°) 
Indeed, 

= Y= cos?4-+sin?@=1, 


* See our book “Differential and Integral Calculus”, Sec. 4.2. 
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i.e. the point (z, y) defined by equalities (3) belongs 
to ellipse (2) for any 8. If @ runs continuously through 
the half-interval [0, 2), then the point (z, y) describes 
a complete ellipse. With a further increase in 6 the motion 
is periodically repeated. 

Let us clarify the meaning of 8 and indicate the method 
for constructing an ellipse (Fig. 37). Draw two concentric 
circles of radii b and a (b < a) with point O as centre. 
Then draw a radius vector at an angle 6 to the z-axis to 


Fig. 37. 


intersect the concentric circles of radii b and a at 7 and JN, 
respectively. Through the point N draw a straight line 
parallel to the y-axis, and through the point 7 a straight 
line parallel to the z-axis. The point of intersection of 
these lines belongs to the ellipse. Indeed, let z be the 
abscissa of the point M, and y the ordinate of this point. 
Then (see Fig. 37) 


zx = ON-cos 9 = acos8, 


y = TR = OT-sin 0 = 6 sin 0, 
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i.e. the point is really situated in ellipse (3) and the 
parameter 9 is the angle between the z-axis and the 
ray ON. Note that 0 is not the polar angle @ which is 
formed by the radius vector OM with the axis x (tan 9 = 


= tan 0). For instance, if 9 = 1/4, a = V3, b =1, 


then 0 = n/3; if g = 0, then 0 = 0; if » = n/2, then 
DS g/2. 
The hyperbola 


4 14 (Q<a, dD). (4) 


a2 be 
Let us set c = Va? + b? and mark on the z-axis points F, 


Vy yt 


24 ‘ 
N /, 
\ ‘sd m4 
+ 
: pA 


Fig. 38. 


and F, (foci of hyperbola (4)) having the abscissas x = —c 
and z = ¢, respectively (Fig. 38). 

Hyperbola (4) can also be defined as the locus of points 
A = (a, y), whose distances from the foci F, and F, 
have a constant difference equal to 2a. 

We have (see Fig. 38) 


AF, — AP, =V G14 VG P=, 
VieteP+y=V (ce? +y¥ +2a, 
(a +c)? y2=(2—e)?-+ y? + 4a Vee PP + 4a, 
4ex— 4a? = 4a V(2—cP +H, 
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c2x? — 2arcx + at = a? (x*— 2exr-+ c?) + a2y?, 
(a? + b?) 2? = ax? + a®b? + a2y?, 
bz? — g2y? = a2b?, 
whence equation (4) follows. 
We have obtained the right-hand branch of hyperbola 


(4). In order to get its left-hand branch, we have to pro- 
ceed from the equality 


AF, — AF, = 2a. 


Reasoning in a reverse order and proceeding from 
equation (4), we can conclude that the points (z, y) sat- 
isfying this equation belong to the above indicated 
locus. 

Judging by the form of equation (4), we conclude that 
hyperbola (4) is symmetric about the x- and y-axes. The 
portion of hyperbola (4) located in the first quadrant has 
the equation 


y= VP—@ ax<sr<0). (5) 


We see that our hyperbola passes through the point (a, 0), 
and with an increase in x on the half-interval [a, 00) 
the ordinate y increases and tends to infinity. The points 
B = (—a, 0) and C = (a, 0) at which the hyperbola 
intersects the z-axis are called the vertices of the hyperbola. 

Figure 38 shows two straight lines: 


b 
ae Ses 1S 


These are asymptotes of our hyperbola. 

Let on the half-interval [a, oo) (or (— oo, a]) there 
be given a curve y = f (x). The straight line y = mzr +n 
is said to be an asymptote of this curve as « —-+ 00 (« — 
— — oo) if 

lim [f (z)—mzx—n]=0 
X->-+ 90 
(respectively, lim [f(z)—mz—n]=0). 
x~—>—0o 

Let us consider the piece of our hyperbola defined by 

equality (5) and compare it with the straight line y = 
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; bz b i ae : b a? 

lim | =-— 2m — a | = lim — ——————- = 0 
sd i xoto % st VY 22—a? 

This shows that the straight line y = x is the asymptote 

of the piece under consideration as c —-+ oo. But then 

the straight line y = ae is said to be the asymptote of 


the (entire!) hyperbola as z —-+.oo. By virtue of sym- 
metry of our hyperbola about the axes, and the symmetry 


of the pair of straight lines y = +22 about the axes, 


we may say that both of these lines are asymptotes of 
our hyperbola as x ~-+- oo, and as  — — oo. 

The right-hand branch of hyperbola (4) can be repre- 
sented parametrically 


a 


“&—= a cosh u => (e"-+ e-"), 
; (—co<u<oo). (6) 
y=bsinhu= (e"—e™ 
Indeed, since 
cosh* u — sinh? u = 1, (7) 


we get from equations (6) 


xe y? : 
—- — + = cosh? u—sinh?u=1. 
a b? 


The upper half of the right-hand branch of the hyperbola 
corresponds to the variation of the parameter u € [0, 00), 
while the lower half to u € (—0o, OJ. 

Let us clarify how the parameter wu is related with the 
parameter 0 in the parametric equation of the ellipse 
and, al the same time, indicate how to construct a hyper- 
bola with the aid of compasses and aruler. Since our meth- 
od of constructing the hyperbola is based on the method 
of constructing the ellipse, we are going to present these 
two construction methods jointly (see Fig. 39). We shall 
confine ourselves to the construction of the parts of el- 
lipse (2) and hyperbola (6) situated in the first quadrant. 
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Draw two concentric circles of radii @ and b with the 
origin as centre. Draw the ray emanating from the origin 
at an angle 0, to the z-axis. Let 7, and NV, be the points 
of intersection of this ray and the concentric circles 
(OT, = b, ON, =a). Drawing through the points 7, 
and NV, straight lines parallel to the z- and y-axes, respec- 
tively, we shall obtain the point Jof their intersection 


Fig. 39. 


M,. = (Xo, Yo) belonging to ellipse (2). We then draw the 
ray OM,. Let Nz be the point of intersection of this ray 
with the circle of radius a, and P the point of intersection 
of this ray and the straight line parallel to the y-axis 
and passing through the point A = (a, 0) belonging to 
the ellipse. The equation of the ray OP can be written 
as follows 

Yo 

Zo 9 


y= 


Hence it follows that the ordinate of the point P is equal 
to Y, = <i . Further, join the points B, = (zy, 0) and N, 


by a diaioit line, and, through the point A, draw a 
straight line, parallel to B,N., to intersect the ray OP 
at the point Q. From the similarity of the triangles OAQ 
and OB,N, we obtain that OO = a?/z). Describing an 
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are of radius OQ from the origin as centre, we intersect 
the z-axis at the point B, = (a*/zxpo, 0). 

Now we draw, through the points B, and P, straight 
lines parallel to the y- and z-axes, respectively. The point 
of intersection of these lines My = (Xo, Yo), where 
X, = a*/xo, belongs to hyperbola (4). 

ine deed, since the point (%, Yo) lies on ellipse (2), we get 


Xé Ye. .-a* ayr a? ( yz )= a= x2 


“a2 =—Ssob2?”~—‘ia”*~*«‘aOSC 2 Ae 
a b are x2b x2 xe a 


i 


b2 
i.e. the point M7, belongs to hyperbola (4). 

Note that B, = (a*/xy, 0)is the point of intersection of 
the tangent to the ellipse at the point 17, with the z-axis 
(see the last footnote). 

Thus, to every point (z, y) of ellipse (2) there corre- 
sponds a quite definite point (X, Y) of hyperbola (4), 
and vice versa. 

Now, if ellipse (2) is represented parametrically, then 


x=acos0, y= bsin 0. 


Therefore, 


Ves 2 y=+ Y —bian0. 


x cos @ ? 


Hence, taking into account (6), we get 


cosh Gee , sinhu=tan0. 
cos 8 
The _. formulas also hold true: 
‘t—cos 6 / coshu—1 © We 
A 
1 _4+sin 0 
e“=coshu+ sinhu= Pe ies tan 6 = 50 
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1.e, 
IU 


u=In tan ( i ay 


d| 
— 


The parabola 
y= 2px (p> 0). (8) 
Let us mark on the z-axis a point F with the abscissa 
x = p/2 called the focus of parabola (8) and draw a 
straight line c = —p/2 termed the directriz of parabola (8) 
(Fig. 40). 


Parabola (8) can also be defined as the locus of points 


Fig. 40. 


A = (z, y) equidistant from the focus and directrix. 
Indeed (see Fig. 40) 


AF? =(2—4)"+¥, 
AB’ = ( x +4)’, 
consequently, 
(2) trte(2 +) 
— pxr+y? = pe, 
i.e. 
ar eee 


Conversely, it follows from this equation that the 
points, satisfying it, belong to the indicated locus of 
points. 
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As is seen from equation (8), parabola (8) is symmetric 
about the z-axis. Its upper half has the equation 


y=V 2px (0<r<oo), (9) 


wherefrom it is seen that when z increases, running 
through the half-interval [0, oo), the ordinate y also in- 
creases from 0 to oo. 

Let us indicate a simple method for constructing para- 
bola (9) with the aid of a ruler and a right angle, or with 
the aid of compasses and a ruler. Draw a straight line 


Fig. 41. 


z= —2p (Fig. 41) and take on it an arbitrary point 
K = (—2p, y), y > 0. Join this point to the origin and 
draw a straight line passing through the origin perpen- 
dicular to the line OK. Draw then a straight line through 
the point K parallel to the z-axis. The latter two lines 
intersect at the point M = (z, y) which belongs to para- 
bola (9), since OA = y is the geometric mean of the num- 
bers 2p and x (y = V 2pz). 

Parabola (8) has no asymptotes*. 

A pair of intersecting lines 


a’x? — b®y? = (ax — by) (ax + by) = 0 (O<a, B). 
(10) 


* See our book “Differential and Integral Calculus”, Sec. 4.20. 
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If a certain point (z, y) satisfies equation (10), then 
il satisfies one of the equations 


ax — by = 0, 


az + by =0 a) 
or both of them. Conversely, if the point (x, y) satisfies 
one of the equations (10’), then it also satisfies equation 
(10). In this meaning (10) is said to be the equation of 
a pair of straight lines. 

We shall prove below that there exists a rectangular 
coordinate system such that in it curve (41), provided 
that it is not an imaginary one, has one of the above 
listed equations (1) to (6). 

Namely: 

for AC — B* > O curve (1) is an ellipse, a point (Cases 
(1), (6)), or an imaginary curve; 

for AC — B? < 0 curve (1) is a hyperbola or a pair 
of intersecting (different) straight lines (Cases (2), (4)); 

for AC — B? = O curve (1) isa parabola, a pair of paral- 
lel or coincident straight lines, or an imaginary curve 
(Cases (8), (5)). 

We allow ourselves to use “curve” even in Cases (4), (9), 
(6) when we speak of a pair of straight lines or of a set 
consisting of one point. 

And so, let there be given an equation 


Az’? + 2Bry + Cy? + 2Dr+2Hy+ F=0, = (Af) 


where the coefficients A, B, C are not all zero. 

Without the loss of generality, we may regard that 
A>0, ASC, BS>O. We can always arrive to this 
situation with the aid of the orthogonal transformations: 


r=, t= —§, 
y=, y= 
and by multiplying both sides of (1) by —1. 


lfB =0, A S>C> 0), then (1) can be written in the 
form 


A(a+4)+C(y+2)+FP-Z-FH=0. tt) 
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The translation 
D L 
g=2+5, neytae 
transforms cquation (11) in tbe following way: 


2 
AB + Cy = 7+. (14’) 


fe D? : , 
If the number aay og ee; then equation (11’) 


represents the equation of an ellipse (Case (1)) with 
the semiaxes a, 6, where 


wn(Bt Fania bo( E42 ae 


Note that in this case AC — B? = AC > 0. 

And if the right-hand member of equation (11°) is 
equal to zero, then we get a point (Case (6)). 

With the negative right-hand member, equation (11’) 
yields an imaginary curve. 

If C =0, A >O, then equation (1) can he written 
in the form: 


A(a+ 4)’ + 2By4+F—-==0. (42) 


Let the number F£ = 0, then the translation 


D PF D? 
Batt, NV+ oF — TA 


transforms (12) into the equation 
AG? + 2ky = 0, (12°) 


which (on replacing, if necessary, y by —n) represents 
the equation of a parabola (Case (8)). 
li & = 0, then, depending on the sign of the number 
2 
— — F, we shall obtain either a pair of parallel straight 
lines, or an imaginary curve. Note that here AC — B* = 0. 
Further, if the number C < 0, A > 0, then equation 
(1) can be written as follows: 


A (2++)°-1¢l(y— fot) +B Apap — =O (13) 
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which is analyzed in the same way as equalion (11). 
Equation (13) yields either a hyperbola, or a pair of inter- 
secting straight lines (Cases (2) and (4)). Note that in 
this case AC — B* = AC <0. 

The case A = 0, C <0 is reduced to an equation of 
type (12). 

Hence, for B = O equation (1) is reduced to one of 
particular cases (1) to (6). 

Let now B > 0,A > C. Then, as we know from the pre- 
ceding section, there exists an orthogonal transformation 


x= £45 — YN, 


14 
Y=Yi8+ 7H, (14) 


where 


1 A—6 
ok V st 2V 4B? (A—C) ” 


| ee a ee 
41 2 2 V4B2+(A—Cy? ” 


which reduces the quadratic form 
Ax’? + 2Bay + Cy’ 


to the canonical form, 
Let us transform equation (1) with the aid of (14): 
Aye? + Agn? + 2D (216 — yin) + 2E (yiS + 20) 
where 
M_ => (440+ V 4B 4 (4—OF, 
dy => [A+ C—V4BF Ft (A— CP 


(Ay >> he, AqAy = AC — B*). We rewrite equation (15) 
in the form 


AE? + hen? + 2 (4D + y,£) E + 2 @E — yD) 
4F=0. (45 
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Equation (10’) is a particular case of equation (41) for 
B = 0, which was already analyzed. 

Thus, we may state that if: 

(1) AC — B* =i),A, > 0, then equation (4) represents 
either an ellipse, or a point, or an imaginary curve. In 
this case we shall say that equation (1) belongs to the 
elliptic type; 

(2) AC — B® = djhk, <0, then equation (1) represents 
either a hyperbola, or a pair of intersecting straight lines. 
In this case equation (1) will be said to belong to the 
hyperbolic type; 

(3) AC — B* = },A, = O, then equation (1) represents 
either a parabola, or a pair of parallel lines, or an imag- 
inary curve. In this case we shall say that equation (4) 
belongs to the parabolic type. | 

Example 1. Find out the character of the curve 


2° +V32yty?+22+2 V3y+F=0, 
where fF is an arbitrary real number. 
Here A=2>C=1, Bn V3 0, AC— B*=?>0, 
i.e. the equation belongs to the elliptic type. It is 


easy to compute (see the example in the preceding sec- 
tion) that 


nV 
——2 


4 5 4 
> hFKs, MHz. ho => 


4 7 


Therefore, with the aid of the orthogonal transformation 
1 
r==(V3§—y), 
4 a 
y= 5 (E+ V3 x) 
our equation will be written as follows 


24S ot (VSE—n) + V3 (EFVS n)+F=0 


\ 


or 
3 (Ete V3) +5 t2%=f-F. 


411—014496 
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In addition, let us realize the translation 


=| “ae 2, 
then we shall have 
5 1 16 , 
ey ee (16) 


lf ~_—F>0, then (16) will be the equation of an 
ellipse with the semiaxes a, 0, where 
a®*=2(16—9F)/25, 0%? =2(16—5F)/5. 


lf 2 _HF= Q, then (16) yields a point. If 2 —F<0, 
neh (16) represents an imaginary curve. 


Sec. 25. Second-order Surfaces in 
Three-dimensional Space 


The equation 


3 3 3 
2) 2s aumem +2.% Ati + B=0, (1) 


where a@;; = @1;,, A;, B are preassigned constants, and 
X = (%, ZY, Z3) is a variable point in R, defines, gene- 
rally speaking, a certain set of points in R, called the 
surface of the second order. If equation (1) is satisfied by 
none of real points x = (z,, %g, z3) then it is said to 
define an imaginary surface. However, we are not inte- 
rested in these cases. In some cases equation (4) may 
define a pair of different or coincident planes or only one 
point. But such sets will also be referred to as surfaces. 
Let us enumerate the most important particular cases 
of equation (1): 
(1) Ellipsoid 
re 

ae 


y? ge 
Sa a er (a, 6, c>0). 
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(2) Hyperboloid of one sheet 


2 3 2 
Sta a=t (a,b,c>0). 


(3) Hyperboloid of two sheets 
(4) Elliptic paraboloid 
£2 a y? —9 ( 0 
sg ee ee) 
(9) Hyperbolic paraboloid 
ce ae 
Dg ee (P,a > 9). 
(6) Cone of the second order 


2 2 2 
atS-Z=0 (@,),¢>0), 


(7) Point 
etyt2a= 0, 
(3) Cylinders of the second order: 
elliptic cylinder 
2 
S+4=1 (4,6>0), 


hyperbolic cylinder 


x2 


y? 
ae pr 1 (a, b>0), 


parabolic cylinder 
y= 2px (p>0), 
pair of intersecting planes 
272 — b*y® = 0 (a, b >0), 
pair of parallel or coincident planes 
z’—a=0 (a>0), 


Zo =), 


(a,b, c>0). 
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straight line 
gee == 0, 


When considering the particular cases of equation (4) 
we regarded that = 21, Y = 2%, 2 = 23. 

It is possible to prove that for each particular equation 
(1), provided that it defines a real (not an imaginary}) 
suriace, we can find a rectangular coordinate system in 
which this equation has one of the above listed eight 
forms. This follows from the general theory set forth in 
Sec. 22, The transformation of equation (41) itself is car- 
ried out in the same way as in the preceding section. 
The finding of the eigenvalues A,, A., Az is reduced to 
solving a cubic equation. 

We would like to indicate one more way of finding 
eigenvalues and eigenvectors, which, as a matter of fact, 
was considered in Sec. 23 for the two-dimensional case. 
The eigenvalues A,, Ag, Az of the self-adjoint operator A 
and associated with them normalized vectors x!, x’, 
x8 (jx? [= 41, Ax’ = Ax’, j = 1, 2,3) can be found 
in the following way (the proof is given below). 

We introduce the determinant (dy; = aj) 


Gyy—h Ai 413 
D (k)=|A—AE|= oq Aog—h Qo3 |, 
Q34 239 A33—A 


where F is a unit matrix. Find the roots A,, Ae, Ag of the 
equation 


|A —AB| =0, (2) 


which is called the characteristic equation of the operator 
A (D (Aj) = 0, j = 1, 2, 3). These are just eigenvalues 
of the operator A. Being real, they may be different, but 
may also coincide, i.e. be multiple. Hence, 


D (A) = (Ay — A) (rn — A) (Ag — Ap 


Sec. 25. Second-order Surfaces 165 


Then for the root 4, we look for the nontrivial solution 
KX = (21, 22, x3) of the homogeneous system of equations: 


(O44 — Ay) 2+ Ayo2 + Q1305=0, 
Oyo + (Beg — Ay) He-+ Ap,t, = 0, (3) 

Bay Ly TP Age + (d33 — Ay) 2g =D 
or of the corresponding equation for the operator A — A,E: 
(A —i1,E) x = 0. (3°) 


where F is a unit matrix and x = (2, 22, %3), 0 = 
= (0, 0, 0) are vectors. 

llf A, is a simple root (i.e. in this case A, is different 
from 4. and from A,), then the rank of the matrix of 
system (3) will be necessarily equal to two (rank (A — 
— 1,E) = 2) and we shall get the only (up to a sign) 
vector xt = (7, x, a$?), satisfying system (3), i.e 

(A = MF) x! = 0 
Ax! = ),x". 


If A, is a root of the second multiplicity (A, = 4. <& As), 
then system (3) necessarily has the rank equal to unity 
(rank (A — 4,F) = 1), and will have two orthonormal 
solutions x! and x? ([x!] = ]x?]—=14, (x', x?) = 0) 
which are eigenvectors associated with the eigenvalue Aj? 

Ax? = A;xi (fF = 1, 2, 4, = A). 


Finally, if 4, is a root of the third multiplicity (A, = 
=A, =A), then system (3) necessarily has the rank 
equal to zero (rank (A —4,F) = 0), and will have 
three orthonormal solutions x!, x?, x?: 


Axi = Asx? (j = 1, Zz os Ay =a Nes == As). 


Any three orthonormal vectors in R; may be taken as 
eigenvectors x!, x?, x? belonging to the eigenvalues 4, = 
— ho Vw 

Let us prove this. We know that in AR, there exists a 
system of orthonormal vectors x!, x”, x3 and real numbers 
Ay, Ag, Ag such that 


Avi=Ap? (fF = 14, 2, 3), 
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And we can indicate an orthogonal matrix A such that 
(see Sec. 22) 


hm, 0 0 
AAA“A=|10 2, 0 
0 0 As 


Hence, for a variable number A the following identity 
holds true 


Ay—A 0 0 
A(A—AE)At=]}, 0 Ky— 0 (4) 
0 0 A,g—A 
(A (A—AE) A? = AAA*— AXEAA= 
— AAA1— AAA? = AAA'S—ALE). 
Above, we denoted the determinant of the matrix A — AE 
by D (A). As it is seen from (4), it is equal to the deter- 
minant of the matrix standing in the right-hand side 
of (4), since 
(hy — 2) re — 2) (hp — 2) = | A (A — 2B) A+ | 
= |A|l[A—AF][ AT = |A—AE| =D (A). 
Thus, 
D (4) = (Ay — A) (Ae — A) (Ag — A). 


We have obtained that the roots of the polynomial D (A) 
coincide with the eigenvalues A,, Az, Ay of the operator A. 
Hence, they are real. 

Let A, be a simple root and, consequently, A, = A, 
and A, - As. Then the matrix in the right-hand side of (4) 
has for )} =A, the rank equal to two (rank A (A — 
— ,E) At = 2), but then rank (A — A,£) = 2. As a 
matter of fact, the solutions of homogeneous system (3) 
and the system 


O24 + (Ag — Ay) 29 + 0-23= 0, 
+24 4 0-2, ++ (Ag — M4) Z,= 0, 


0-z,+0-z,+0-2,=0, 
6) 
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corresponding to matrix (4) are transformed into each 
other with the aid of an orthogonal matrix (systems (3) 
and (5) are equivalent). But system (5) has only one (up 
to a sign) normalized solution (+14, 0, 0). And this 
is possible only if rank (A — A,E) = 2. 

It is possible to give the following explanation of this 
fact. If we assume that all second-order determinants 
generated by the matrix A —iA,F are equal to zero, 
then all the second-order determinants generated by the 
matrix A (A —A,E) A“, will also equal zero, since 
these determinants are linear combinations of the second- 
order determinants from the matrix A — A,#. But this 
is impossible, since the determinant generated by the 
matrix A (A —iA,E) At 


hea—™, OO : 
e- Sele 


If now Ay = A, Az, then, reasoning in a similar way, 
we shall obtain that the rank (4 — A,#) = 1 and, hence, 
there exist exactly two orthonormal solutions x!', x? 
of system (3) corresponding to A, = Ag. 

Finally, for 4, =A, =A, rank (A —- A,E) = 0, i.e. 
all the elements of the matrix A — A,£ are zero. In this 
case any vector x = (2, Zo, X3) isa solution of system (3). 
This results in that any three vectors x!, x’, x? forming 
an orthonormal system will be the eigenvectors belonging 
to the eigenvalue A, = A, = Ag. 

Note that in this situation the quadratic form isalready 
reduced to the sum of squares (a,5 = @13 = G3 = O, 
Qi, = Aga = A33 = Ay). 

Example t. Reduce the following to the canonical form 


wt + xo + Qt, + 2xyrg + 2X9Ky. 


Here Q14 = Beg = Ax ial Q13 — Qo3 >= t, 433 => Q. Let 
us set up a characteristic equation 


1—A 1 4 
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or 
—A(4—-AP+2—-2U—AN+1=0, 
—i (A — 0)? + 34 = 0. 


It is easy to see that A = 0 is a root of this equation. 
Let us find two other roots: 


3—(1—aA®>=0, “4—A?=—3, 1-A=4YV3, 
=1+/YV/3. 
Hence, 


Me =1+V3. Ag=0, A,=1-—V3, 
i.e. we have obtained the case: i, > A, > As. 
Let us find the eigenvector x'. To this effect, we set up 
system (3): 
ca V3 t+ Le tz =0 
ty—V 382,+ tz =0 
t+ 2,-44+V)V3)2,=0. 


Any two equations of this system are linearly independent. 
Solving the system of the first two equations, we get 


44+ y3 itV3 


Ly 
Thus, the vector 


1+V3 1+V3 _ 
yin(ttVS,,, HVE 4, 2) 


is a solution of the system and, normalizing it, we 
get the eigenvector 


tan ( HVE _igvi ty 
yl \eV34 V3’ 2Vary3° V3+V3!" 
Let us find x? (A. = 0): 
Go ae: ay 0 
Bote a3 0 


t+ 2, + 0x, = 0, 
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Solving the system of the last two equations (since the 
determinant made up of the coefficients of z, and zz 
is not equal to _2eF0), we obtain %, = —%, %3 = 0. 

The vector y® = (2, —z,, 0) is the solution of the 
system and the vector 


is a unit eigenvector. It is easy to see that it is orthog- 
onal to x! (the scalar product of these vectors is zero). 


Finally, for 4, = 1 — V3 we find that 
fac Greer. a co ey - te 
V 64273 V642V3 V6+2V3 
is the third eigenvector. 
The orthogonal matrix for passing over from the coor- 
dinates of the vector x = (%,, 2., 73) in the system (i, 
j, k) to the coordinates of the vector x = (&, &,, &) 


in the system (x', x”, x°) has the form 


6 8) (I> (LD 
ay Xe Ls 


Az sa ve Le 
a? e® a 
1 2 
(JA}]=4, Aut (i)=x!, A-t(j) =x?, A‘ (k) =x), i.e. 
HeSsy bye, Se 2, Sa; 
Lo = ey a3 we So Ss Leda oe (6) 
Ly = 5°, + 03" Eg + 25° E3- 

This transformation retains the orientation (since 
| A | = 12> 0), ie. the system (x!, x?, x°) is oriented 
in the same way as the initial system (i, j, k). 

Substituting, instead of z,, their values expressed by 
formulas (6) into our quadratic form, we get its ca- 
nonical form 


(14+V3)8+(1-V 3). 
Now we would like to present a more detailed study of 


the equations and the corresponding surfaces of the above 
indicated eight types. 
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The ellipsoid 
ets Ve at Gb, oo 0), (7) 


a2 | pb? ' ¢ 

For a = 6 = ellipsoid (7) turns into a sphere of ra- 
dius a with centre at the origin, i.e. into the locus of 
points lying at a distance a from the origin. 

The quantities a, b, c are called the semiazes of the 
ellipsoid. 

If in equation (7) we replace (simultaneously or sepa- 
rately) z by —z, y by —y, z by —z, then the equation 
remains unchanged. This shows that ellipsoid (7) is a 
surface symmetric both about the coordinate planes 
zx=0, y=0, z=0 and the origin. Therefore, it is 
sufficient to study the equation of ellipsoid (7) in the 
first octant (of the coordinate system), i.e. for z > 0, 
y > 0, z>0. The portion of the ellipsoid located in 
the first octant is defined by an explicit equation, for 
instance, 

Y4 Oe zx? y2 
z=c] eer aa ke x0, y 0, Sa pe 
For the sake of definiteness, we shall assume that a > 
> b> c. The ellipsoid is a bounded surface. It is found 
inside a sphere of radius a with the origin as centre: 
the coordinates of any point (x, y, 2) of the ellipsoid are 
related by the following inequality: 
etytre<a@ (= +-£+4) = g?-{ —q? 
va a b2 ' G8 eae: 

In order to form a more exact notion about the ellip- 
soid, let us intersect it by cutting planes parallel to 
the coordinate planes. For instance, cutting the ellipsoid 
by planes z = h (—c <h<c), we shall intersect it in 
ellipses 

x2 2 h2 
ae la ee 


with the semiaxes 


ae a — 
i) i, bj 1. 
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Hence it is seen that the greatest ellipse is yielded by 
the cutting plane z = Q. A similar result will be obtained 
by cutting the ellipsoid with the planes 7 = h (—a <A 
<a), y=h(—b<h<d). 

Ellipsoid (7) has the form represented in Fig. 42. 

The points (a, 0, 0), (0, +b, 0), (0, 0, +e) lie on 
ellipsoid (7) and are called its vertices. 

If any two semiaxes are equal to each other, then el- 
lipsoid (7) will be the ellipsoid of revolution, i.e. an ellip- 
soid generated by rotating an ellipse about one of the 
coordinate axes. 

The hyperboloid of one sheet 

2 2 2 
—+4-4=1 (a, b, c>0). (3) 

Judging by the form of equation (8), we conclude that 

the hyperboloid of one sheet is a surface symmetric both 


Fig. 42. 


about the coordinate planes and the origin. The quan- 
tities a, b, c are called the semiazes of the hyperboloid of 
one sheet. The points (+a, 0, 0), (0, --b, 0), lying on 
the surface of hyperboloid (8) are called the vertices of the 
hyperboloid of one sheet. 
Cutting surface (8) by a plane z = h, we intersect it 
in an ellipse 
v2 2 h2 
F414 oo 


q2 b2 
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with the semiaxes 


h? : h? 
a es 
With 2 varying from —oo to +00, this ellipse describes 
surface (8). 
Intersecting now surface (8) with a plane x =/h (or 
y =h), we shall cut off a hyperbola 
Di sa ee (5 -4=1-=) 


“B22 ae 
For h = -ta the first hyperbola decomposes into two 
straight lines y = +2 Z. 


If | | <a, then the real transverse axis of symmetry 
of the corresponding hyperbola is a straight line parallel 
to the y-axis, and for | 2 | > ait isa straight line parallel 
to the z-axis. 

The real (transverse) axis of symmetry is defined as one 
of the axes of symmetry which is intersected by the hyper- 
bola. 

If a = b, then planes z = h will cut surface (8) in 


circles of radius a1 ++ (h2/c?). In this case surface (8) 
2 2 
is generated by rotating the hyperbola _ — = = 1 about 


the z-axis. The general view of a hyperboloid of one sheet 
is given in Fig. 43. 
The hyperboloid of two sheets 


a bb (a, b, c>0). (9) 


Since equation (9) contains only the squares of the 
variables, the given surface is symmetric both about the 
planes x = 0, y = 0, z = O and the origin. 

Let us now write equation (9) in the general form 

g2 


2 2 
tos 14+. (9’) 
Hence it is clear that, cutting surface (9’) with a plane 
z=h(|h |S), we shall obtain an ellipse 
y2 32 


eh2 
_ fi 
oe ge ae. 
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with the semiaxes 
bY (h/a®\)—1, cV (h2/a2)—1. 

For | 2 | < a the number (h?/a?) — 1 <0, and, therefore, 
there are no points of intersection of surface (9’) and 
the plane z = Ah. 

Cutting surface (9) with planes z = h (y = h), we get 
hyperbolas 

x y? h2 2 2 h2 

3 —-#=14+4(@-F=!1+ w). 


q2 


The points (--a, 0, 0) lie on surface (9) and are called 


Fig. 43, Fig. 44, 


the vertices of the parted (two-sheeted) hyperboloid. Sur- 
face (9) is represented in Fig. 44. 
The elliptic paraboloid 


ie oe ee aa 0) 10 
p qg om P, q> ° ( } 


Since equation (10) contains the squares of the vari- 
ables z and y, we conclude that the given surface is sym- 
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metric about the coordinate planes « = 0, y = O. Fur- 
ther, since we hold that p, g > 0, surface (10) is situated 
in the half space z > 0. 

Cutting surface (10) with planes z=h(h>0), we 
shall intersect it in ellipses 


with the semiaxes 
V2ph, V 2gh. 
As h varies from zero to oo, these ellipses describe our 
surface (10). 
Cutting surface (10) with planes z = h (or y = h), we 
intersect it in parabolas 


v= 2g (2-35) (2*=2n(2—F)) 


‘ ‘ h2 h2 
with a displaced vertex at the point ary (2=5). 

For p = q surface (10) will be a surface of revolution 
generated by rotating the parabola x? = 2pz about the 
z-axis. In this case surface (10) is 
called the paraboloid of revolution. 

The point (0, 0, 0) lies on sur- 
face (10) and is known as the ver- 
tex of the elliptic paraboloid. The 
elliptic paraboloid is represented 
in Fig. 45. 

The hyperbolic paraboloid 


we -L=2% (p,q>0). (11) 


Judging by the form of equation 
(14), we conclude that this surface 
Fig. 45. is symmetric about the planes x = 
—0,y —0. Cutting surface (11) 


with planes z = h, we obtain hyperbolas 


Sec. 25. Second-order Surfaces 175 


where for 2 > 0 the real axis of symmetry of the hyperbola 
will be parallel to the z-axis, and for kh < 0 tothe y-axis. 
For kh = 0 we shall get a pair of intersecting straight 


lines. 
When cutting surface (141) with planes z = h or y = h, 
we shall get parabolas with their branches directed down- 


wards or upwards: 


Surface (11) is depicted in Fig. 46. 
The cone of the second order 


2 2 2 
t+ a= 0 (a, 6, e>0). (12) 


It is clear that the given surface is symmetric both 
about the planes z = 0, y = 0, 2 = O and the origin. 


Fig. 46. 


When intersecting surface (12) with planes z = h, 
we Shall obtain ellipses 
z2 y2 h2 
aoe =e 
with the semiaxes a|h |/e and b] h |/e. 
And if surface (12) is cut by planes t = h or y =A, 
then it will be intersected in hyperbolas 
22 y? h2 z2 x h2 
ear ae gre) 
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Returning to (11) and intersecting it with planes y = hz, 
we shall get a pair of intersecting lines 


z= +exrV (1/a®) + (A°/0). 
The cone under consideration is shown in Fig. 47. 
The point 
ey? 2 =: (13) 
This equation is satisfied only by one point: z =y=z = 0. 
Cylinders of the second order: 
(a) The elliptical cylinder 


= +4 =1 (a, b>0). (14) 


Equation (14) does not contain the variable z. On the 
plane zOy, equation (14) defines an ellipse with the semi- 
axes a and 0b. If a point (a, y) lies on this ellipse, then 


Fig. 47. Fig. 48. 


for any z the point (z, y, 2) lies on surface (14). The set 
of such points is a surface described by a straight line 
parallel to the z-axis and intersecting the ellipse 


Ela Oe eee, 


in the plane zOy. 
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Ellipse (14) is called the directrix of the surface in ques- 
tion, and all possible positions of the generating line 
(generatrix)—the elements. 

In general, a surface generated by a straight line which 
remains parallel to a certain given direction and inter- 
sects a given curve ZL is called cylindrical. Surface (14) 
is shown in Fig. 48. 

(b) Hyperbolic and parabolic cylinders 


xr2 


2 
—--4=1 (a, b>0), (19) 


y° = 2px (p > 0). (16) 

In this case the directrices are a hyperbola and parab- 
ola, and the elements are straight lines parallel to the 
z-axis and passing through the hyperbola and parabola in 
the plane zOy. 

Surfaces (15) and (16) are represented in Figs. 49 and 950, 
respectively. 

(c) Parallel and intersecting planes. A straight line 


ay — by? =0 (a, b> 0), (17) 
z—_a=0 (a>Q0), (18) 
Ze), (19) 
be | aes 8 (20) 
For surface (17) the directrices are straight lines 
Y= oe — a; 


Therefore, surface (17) is a pair of intersecting planes. 
The equations of both surfaces (18) and (19) are void of 
two coordinates each. Equation (18) in the plane zOy 
is a pair of straight lines x = La. 

If we shall take x = +a and any y and z, then the 
points (a, y, 2) will satisfy equation (18), therefore 
surface (18) is a pair of parallel] planes. 

Equation (19) describes the plane zOy, since this equa- 
tion is satisfied by any points of the form (zx, y, 0) the 
entire set of which just constitute the plane xOy. 

It is also possible to regard z = O as a directrix in one 
of the planes xOz or yOz, where the elements are straight 


42—01496 
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Fig. 50. 


lines parallel either to the y- or z-axis and passing through 
the straight line z = 0. 

Equation (20) is satished by any point with z = y = 0 
and any z. Therefore, this equation defines a straight line, 
namely, the z-axis. 

Ruled surfaces. 

Some surfaces of the second order are generated by a 
moving straight line. Such are all cylindrical surfaces 
and a second-order cone. But there are other surfaces 
which are also generated by a moving straight line. 

A surface that can be generated by a moving straight 
line is called a ruled surface, and the straight lines en- 
tirely lying in it are termed rectilinear generators. 

The hyperboloid of one sheet and hyperbolic paraboloid 
belong to the ruled surfaces. 
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The equation of hyperboloid of one sheet (8) can be 
written in the form 
psp ie) (ee 
a c b2 


or, factorizing both sides of the equation, we get 


(E+4)(£-2)=(14 $) (1-4). eh 


Let us compose a system of the first degree: 


c Z y 
Feri Come 

7 B.A ry (22) 
£-$=$ (1-4). 
where *& is an arbitrary parameter. 

For a definite value of the parameter & we obtain a 
straight line, thus, as & varies, we get a family of straight 
lines. Multiplying equations (22) termwise, we shall 
obtain equation (21) of our surface. Therefore, any point 
(z, y, 2), satislying system (22), is located on surface (21). 
Consequently, each line belonging to family (22) lies 
entirely on the surface of the hyperboloid of one sheet. 

Quite analogously, the system 


x Bo ye 

a | 8 ), 
x a 24 y 
Paria. Geo 

where / is a parameter, also defines a family of straight 

lines, which is different from family (22) and belongs to 

surface (21). 

Through every point of hyperboloid (21) there passes 


one straight line of either family, generally speaking, for 
different values of the parameters y — L (Fig. 51). For 


instance, through the point (V/ 3a, ¥2 =~ b, c} of surface (21) 


there passes a straight (22) a k= (2 + V 6)/ /(2 + 
+ V 2) anda straight line (23) for 1 = (2+ V6)/(2 — va 

The ruled-surface nature of a hyperboloid of one sheet 
was utilized by the Soviet engineer V. G. Shukhov (1853- 


{2* 


(23) 
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1939) in the construction of what is called the “Shukhov 
Tower” of Moscow (in the Shabolovka street) which for 
years was used as the Moscow radio and television tower. 
It was constructed out of steel strips arranged along rec- 
tilinear generatrices of a hyperboloid of one sheet. The 
strips were riveted together at the points of intersection 
of the two systems of generators. Shukhov’s structure 
possesses high strength, though a relatively small amount 
of material was used in the construction. Various tall 
towers have been constructed 
using Shukhov’s method. 

It can be easily verified that 
the two families of straight lines: 


6 ot 
+ —¥_ = 2kz, 


Vp Va 

os WS ey) 
Vp. va 
ee 

Me . (25) 
Ve Ve” 


generate the surface of hyper- 
bolic paraboloid (11). 

The straight lines belonging 
to families (24) and (20) lie on 
this surface and, conversely, any 
point of this surface is the intersection of a certain 
straight line of family (24) anda line belonging to family 
(25). 


Sec. 26. The General Theory of a Second-order 
Surface in Three-dimensional Space 


Let there be given a surface of the second order 


3 3 3 
p2) 2, ApILp ti + 2 ba Ajz;+ B= 0, (4) 


where d,1 = @,, Ai, B are constant numbers—the coefhi- 
cients of the equation. 
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The preceding section described eight forms ((1) to 
(8)) (particular cases) of equation (1) and it was noted there 
that for every given equation (1), provided that it defines 
not an imaginary surface, it is possible to find a rectilinear 
coordinate system in which this equation has one of the 
indicated forms. 

This statement is proved below. 

To begin with, we consider the quadratic form standing 
in the left-hand side of equation (1). 

On the basis of Theorem 2 of Sec. 22, this form can be 
reduced, with the aid of the orthogonal transformation 


3 
z,= 2) Bi sXs (i= 1, Z 3), (2) 
s=1 


to the following form: 


3.3 
\1 a , fo 9 


=| t= 


where A,, Ao, 43 are definite real numbers. 

We should like to underline that equalities (2) define 
the transformation of the initial rectangular coordinate 
system (4, Zo, Z3) to some other rectangular system 
(x1, Z,, ,). The point, whose coordinates in the initial 
system is tn, Zo, Z3), in the new system has the coor- 
dinates (z/, x,, x3) obtained by inverting operation (2). 

Tn the new rectangular system our surface has, obviously, 
the following equation: 


Nyts? + gas? + Agog? + 2 SI S Anh +B=—0, (3) 


where A; are some constant numbers. 

Let us first consider the case when all the three num- 
bers are different from zero (A; #0, i = 1, 2, 3). 

In this case we transfer the coordinate system x), z,, 
x, so that its origin goes into the point (a,, @,, as); we 
then obtain a second rectangular coordinate system (&, 
Ee, &3), where 


m=a+8 & =1, 2, 3). 
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In it the equation of our surface has the form 


As (4 +1)? + Ag (€g + &)® + Ag (23+ €3)? 


3 
+2 3) Ak (an + &)+B=0 


or 
3 
Mert Mabe + Ass + 2 21 (aids + Ai) &: + Bi= 0, 
where B, is some constant. Setting 
a. 24.8 
rn Oe ( —= 1, G, ), 


we simplify this equation: 
AS; =i Asks + A3&5 = —B)\. (4) 


Suppose the numbers A,, 4., A3 are of the same sign. 

If in this case B, = 0, then equation (4) is satisfied only 
by one point, namely, by the zero point (0, 0, 0) (see 
(7) in the preceding section). 

If B, ++ 0 and has the same sign as the numbers Ay, 
he, Az, then, obviously, there are no points with real 
coordinates which satisfy equation (4). In this case sur- 
face (1) is imaginary. 

And if B, = 0 and has the sign opposite to that of the 
numbers A,, As, As, then equation (4) can be written in 
the form 


ee a ee 
— By + — By { = B, | (9) 
An he Ag 
or, setting 
B B B 
2 #1. / ne oe , eee race 
a = Ay 9 5 Ao ’] C Ag ’ 
in the form 
es ee 
a ae get ee 
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Hence, surface (1) is an ellipsoid (see (1) in the preceding 
section). 

Let now two of the numbers A,, A., A3 be of the same 
sign and the third have an opposite sign. 

If B, = 0, then in equation (4) we may regard that 
A, > 0, A, > 0, A, <0, multiplying (4), if necessary, 
by —4 and interchanging, if necessary, &;. Then, making 


1 1 { 
Maa hg =e Ag= — (a, b, c>0), 


we shall get the equation of a cone (see (6) in the preceding 
section) 


For B, ~ 0 we take advantage of formula (5) once again. 
We distinguish here two essentially different cases: 


1, 148 8 4 (ayperboloid h 9 
re ns a ae yperboloid of one sheet, see (2) 


in Sec. 25), 
9, 8 4 (nyperboloid of two sheet 3 
a amiss wan ca yperboloid of two sheets, see (3) 
in Sec. 25). 


The remaining cases are reduced to these two by the 
corresponding replacement of the coordinates &,. 

Let now A = A,A,A, = 0. Then at least one of the 
numbers A,, Ag, Az is equal to zero. We shall regard that 
4, = 0, interchanging, if necessary, &,. 

So, let A, = 0. We first single out the case when A, = 0. 
Then equation (3) has the form 


Myr? + Agr? + 2 (Alix, + Apr.) + B =O, 


where 4,, A,, Aj, A,, B are any numbers. 

In the plane (z{, x,) this is the general equation of 
a second-order curve. In space (xj, x,, x3), it is the 
equation of a cylindrical surface (see (8) in the preceding 
section) passing through a plane second-order curve with 
the generatrix parallel to the axis x, (see (8) in the pre- 
ceding section). 
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Further, we shall always hold that A, 40 and A, = 0, 
and then equation (3) has the form 
(Ayxy? + 2A ji) + (hon? + 24521) 

+ 2A;32,+B=0. (3’) 

Here, we distinguish the following: essentially differ- 
ent cases: (a) Ay, A, > 0; (b) A, > 0, A, << O; (c) A, > O; 
A. = 0. Other cases are reduced to these by replacing 
the coordinates or multiplying by —1. 

Consider Case (a) 4,, 4. >> 0. Take the factors A, and A, 
out of the parentheses and complete the expressions between 


these parentheses to perfect squares. Then, taking into 
account that A,, A., A; are different from zero, we shall geu 


Ay (2 + &)? + Ag (1, + B)? + 24; (@, + y) = 0, 
where a, 8, 7 are relevant numbers. Setting 
E=2a,+a, y= 2,-+ 8, C=2,+y, 


we get 
Ay + hen? = —2A56 
or 


2 2 


If —A; > 0, then equation (6) has the form 
S44 a2 (elliptic paraboloid, see (4) in 


Sec. 25) or (7) 
P, @q . 


And if ~-A, <0, then replacing ¢ by —£, we get once 
again the equation of form (7), i.e. an elliptic paraboloid. 

Consider now Case (b) A, > 0, A. <0, (A; 0). We 
take advantage of equation (6). If 4,;< 0, then this 
equation is written in the form 


Tax (hyperbolic paraboloid, see (5) in 
Sec. 25) (3) 


(p, g >0). 
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And if A; > 0, then, on interchanging € and yn we 
shall get again the equation of form (8), i.e. a hyperbolic 
paraboloid. 

Finally, we pass over to Case (c) A, > 0, A, = 0 (A; + 
+ 0). Then equation (3’) is reduced to the following: 


(Ayay" + 2A 521) + 2 (Aja, + Ajz;) + B= 0. 


Factoring A, out of the first parentheses and supple- 
menting the expression between these parentheses to a 
perfect square (taking into consideration that 4,, A, #0), 
we obtain 


Ay (x, + a)? + 2A jr) + 2A, (x, + 8) = 0, 
where a, 6 are some numbers. On substituting 
E=-a,+a, n=sz, €=2,+ 8, 
this equation will turn into the following: 
AS? + 2 (Asn + A536) = 0. (9) 


Consider in the plane (yn, ) the vector = (A,, A). 
Let us write it in the form 


(A;, A;) =e (cosa, sin a), 


where p > 0 is the length of , and (cos a, sina) isa 
unit vector in the direction of m. The second vector 
(sin a, —cos a) is also a unit vector, it is perpendicular 
to the first one. 

Let us introduce in the plane (y, &) the following ortho- 
gonal transformation: 


u=ycosa+Csina, v=ysinae — Ccosa. 
By means of this transformation we replace the rectangu- 
lar coordinate system (ny, ¢) by the rectangular coordi- 
nate system (u, v), and the rectangular coordinate system 


(E, , €) by the rectangular coordinate system (&, u, v). 
As a result, equation (9), which can be written as 


8? + 2p (n cosa + Esin a) =0, 
takes the following form: 


Ay5? = 2 pu = 0, 


Text 
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or, substituting —w for u, 
Be = 2pu (p = p/h, > 0), 
or, finally, interchanging € and uw, 
u*= 2p— (p> 0), 


i.e. we have obtained the equation of a parabolic cylinder 
(in the rectangular coordinates (€, u, v)). 

Thus, we have considered all possible cases for equa- 
tion (1) and in each of them have found the rectangular 
coordinate system in which equation (1) has one of the 
forms listed in Sec. 25 ((1) through (8)). The statement 
has been proved. 
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